(12) INIIERNATIOMAL APFUCATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCX) 



(19) World Intellectual Property Organixatioii 
Jniematjonal B\ireau 

(43) Internatiomial Publication Date 
26 January 2006 (26.01.2006) 




PCT 



iiilllliiililiil 



(10) International Publication Number 

wo 2006/009888 A2 



(51) InternationaJ Patent ClassUicetion: 

GOIN 33/53 (2006.01) C07K 7/08 (2006.01) 
GOIN 33/543 (2006.01) 

(Zl) InternaUansd Application Muiuber; 



(22) International Filing Bate: 

(25) Filing Language: 

(26) Publication Liiogwage: 



PCT^3S2005/021558 
17 June 2005 (17.06.2005) 
English 
EngHsh 



(30) Priority Oata: 

10/871,602 
10/957.351 



17 June 2004 (17.06.2004) US 
30 September 2004 (30,09 2004) US 



(71) Applicant (for all dfisigmted States except USy. AVIDIA 
RESEARCH INSTFTUTK [USAJS]; 2450 Bayshore 
Parkway. Moutain View, CA 94043 (US). 

(72) Inventors; and 

(75) Invcntors/AppBcanls (for US only): STEMMER, 
Willcm, ?,y C, [NL/US]; 108 Kalhy Court, Los Gaios. 
CA 95030 (US). PERI^ROTH, D., Victor [US/US]; 2345 
Cornen Strcei, Palo AUo. CA 94306 (US). SATVAL, 
Sanjcev [IN/US]; 3320 Brittan Avenue, #1, San Carlos, 
CA 94070 (US). ALBA, Benjamin, M. fUSAJS]; 1245 
i Cresiwood Dri-ve, South San Francisco, CA 94080 (US), 
\ BAKKER, Alice [USAJS]; 1234 Bubb Road, Cupertino, 
\ CA 95014 (IJS). DUGUAY, Amy, N. [US/US]; 1 Elmhurst 
I Drive, San Francisco, CA 94132 (US). LIU, Qiang 
; [USAJS]; 55 WJIiams Lane, Foster City, CA 94404 (US), 
! SitVERNUN, Joshua [USAJS}; 856 Georgetown Place, 
: San Jose, CA 95126 (US). SMITH, Richard [GBAJSj; 
I 113 Sierra Visla Avenue, Apt H, Mountain View, CA 



94043 (US)- SWIMMERj Candacc [USAJS]; 
CaroHna Street, SaJi Francisco, CA 94107 (US). 



1064 



(74) Agents: HINSCH, Matthew, et al.; Townsend and 
Townscnd and Cr^w LLP, Two Embarcadero Center, Sih 
Hoor, San Francisco, CA 941 11-3834 (US). 

($1) Designated States (unless otherwise indicated, for every 
kind of national protection available)'. AB, AG, AL, AM, 
AX AU, AZ, B A, BB. BG, BR, BW, BY, BZ, CA, CH, CN, 
CO, CK, CU. CZ, DE, DK. DM, DZ, EC, EE, EG, ES. Fl, 
GB, GD. GB, GH, GM, liR, IIU, ID, IL, IN, IS, JP, KB, 
KG, KM, KP, KR, KZ, LC, LK, LK, LS, LT, LU, LV. MA, 
MD, MG, MK. MN, MW, MX, MZ, NA, NG, NT, NO, NZ. 
OM, PG. PH. PL, PT, RO. RU. SC, SD, SB, SO, SK, SL, 
SM. SY, TJ, TM, TN, TR, TT, TZ. U A, UG, US, UZ, VC, 
VN, YU. ZA,ZM, ZW. 

(84) Designated Stales (mkss otherwise indimed. for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KB, LS, MW, MZ, NA. SD, SL, SZ, TZ, UG, ZM. 
ZW), Eurasian (AM, AZ, BY. KG, K2, MD, RU, TJ, TM), 
European (Af, BE. BG, CH, CY. CZ, DE. DK, BE, ES, Fi, 
FR, GB, OR, IIU, IB, IS, IT, LT, LU, MC, NL, PL. PT, RO, 
SB, ST, SK, TR), OAPT (BP, BJ, CF, CG, CI, CM, GA, GN, 
GQ, GW, ML, MR, NE, SN, TD. TG), 

Declaration under Rule 4.17; 

— of inventorship (Rule 4 J 7(iv}) for US only 

Published; 

— without Intentaiional search report and to be republished 
upon receipt of thai report 

For iwo-imer codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing ai tlw be^in- 
ning of each regular issue of the FCT Gazette, 



(54) Title: C^MET KINASE Bim)TNG PROTONS 



< 

GO 
QC 
QO 
OS 

o 

o 
o 




□ RM12 
• M26 

A cMetFc 



log [inhibitorj nW! 



^ (57) Abstract: PolypepJidos comprising monomer domains thai bmd lo c^MET, or portions thereof, arc provided. 



wo 2006/009S88 



PCT/US2005/G2155« 



cMT KINASE BINDING PROTEINS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[Oil This application is a continuation-in-part of U.S. Patent Application 
No. 10/957,351, filed September 30, 2004, which is a continuation-in-part of U.S. Patent 
Application No. 10/871^602, filed June 17, the disclosures of each of which are incorporated 
by reference in their entbety for all pnrposcs. 

BACKGROUND OF THE INVENTION 

(02] Hepatocyte Growth Factor/Scatter Factor (HGF/SF) is a mesenchyme- 
derived pleiotropic fe^tor, which regulates cell growth, cell motility, and morphogenesis of 
varioxis types of cells and mediates epithelial-mesenchymal interactions responsible for 
morphogenic tissue interactions during embryonic development and organogenesis. 
Although HGF was originally identified as a potent mitogen for hepatocytes, it has also been 
identified as an angiogenic growth factor. 

{031 Met was first identified in the 1 980s as an oncogene and is the receptor 
for HGF. The proto-oncogene c-MET, was found to encode a receptor tyrosine kinase. In 
response to HGF treatment a range of activities are observed: phosphorylation of receptor, 
docWng of signaling intermediates Gab-1/Grb2, culminating in activation of kinases such as 
PI3K, ERKl and 2, and AKT. These activities aid in cell growth, survival, migration, and 
neovascularisation. 

[04] Inappropriate expression or signaUng of the receptor tyrosine kinase 
Met and its ligand Hepatocyte Growth Factor/Scatter Factor {HGF/SF) is associated with an 
aggressive phenotype and poor clinical prognosis for a wide variety of solid human tumors. 

[05] Four lines of evidence cement the case for a role of c-MET in cancer: 

[06] First, mouse and human cell lines that ectopically overexpress HGF 
and/or Met become tmnorigenic and metastatic in athymic nude mice. Secondly, 
downregulation of Met or HGF expression in human tumour cells decreases their tmnorigenic 
potential. Mouse models that express tlie receptor or Ugand as a transgene develop various 
types of tumour and metastatic tumors. Third, a large number of studies show that HGF 
and/or Met are firequently expressed in carcinomas, in other types of human solid tumours 
and in their metastases, and that HGF and/or Met over- or misexpression often correlates with 
poor prognosis. Fourth, unequivocal evidence that implicates Met in human cancer is 
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provided by the activating mutations that have been discovered in both sporadic and inherited 
forms of human renal papillary carcinomas, 

BRIEF SUMMARY OF TBE INVENTION 
5 [07} The present invention provides a polypeptide comprising a monomer 

domain that binds to c-MET. In some embodiments^ the monomer domain: 

is a non-naturally-occurring monomer domain consisting of 30 to 50 amino acids; 
comprises at least one disulfide bond; and optionally, 
binds to an ion. 

1 0 f 08] In some embodiments^ the monomer domain is an LDL receptor class 

A monomer domain, hi some embodiments, the monomer domain is an LDL receptor class 
A monomer domain comprising the following sequence: 
EFXCXNGXCIPXXWXCDGXDDCGDXSDE, 
wherein X is any amino acid* 
] 5 [09] to some embodiments, the polypeptide comprises at least one and no 

more than six monomer domains that bind c-MET. In some embodiments, the polypeptide 
comprises at least two monomer domains that bind c-MET. 

[10] In some embodbnents, the polypeptide further comprises a second 
monomer domain, wherein the second monomer domain has a binding specificity for a blood 
20 factor, thereby increasing the serum haltlife of the polypeptide when the polypeptide is 
injected into an animal compared to the serum half-life of a polypeptide lacking the blood 
factor-binding monomer domain. In some embodiments, the blood factor is serum albumin, 
an immunoglobulin or an erythrocyte. In some embodiments, tfie second monomer domain 
binds to immunoglobulin (IgG) and the second monomer domain is an LDL receptor class A 
25 monomer domain comprising a sequence selected from the following: 
CXSSGRCIPXXWVCDGXXDCRDXSDE, and 
CXSSGRCPXXWLCDGXXDCRDXSDE, 
wherein X is any amino acid. 

[11] In some embodiments, the second monomer domain binds to 
30 immunoglobulin (IgG) and the second monomer domain is an LDL receptor class A 
monomer domain comprising the following sequence: 

[EQ]FXCRX[ST]XRC[IV]XXXW[ILV]CDGXXDCXD[DN]SDE, 

wherein X is any amino acid and amino acids in brackets are alternative amino acids 

at a single position. In some embodiments, the second monomer domain comprises 
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CHPTGQFRCRSSGRCVSPTWVCDCiUMUUUiJiNtiDEliNCSAPASEPPGSL, In 
some embodiments, the second mononner domain comprises 
CHPTGQFRCRSSGRCVSPTWVCDGDNDCGDNSDEENC 

[12] In some embodiments, binding of at least one monomer domain to c- 
5 MET inhibits dimerization of Met. In some embodiments, at least one monomer domain 
binds to the Sema domain of c-MET^ thereby preventing binding of Met Hgands to c-MET. 

{13] In some embodiments, the polypeptide comprises at least one and no 
more than six monomer domains. In some embodiments, the polypeptide comprises at least 
two monomer domains and the monomer domains are linked by a linker. In some 
1 0 embodiments, the linlcer is a peptide linker. In some embodiments, the linker is between 4 to 
12 amino acids long. 

[14] k. some embodiments^ the monomer domains are each between 35 to 
45 amino acids. 

[15] In some embodiments, each monomer domain comprises two disulfide 
1 5 bonds. In some embodiments, each monomer domain comprises three disulfide bonds. 

[16] In some embodiments^ the ion is a metal ion- In some embodiments, 
the ion is a calcium ion, 

[17] In some embodiments, at least one of the monomer domains is derived 
from a LDL-receptor class A domain. In some embodiments, at least one of the monomer 
20 domains is derived from an EGF-Iike domain- 

[18] In some embodiments, the monomer comprises an amino acid 
sequence in which at least 10% of the amino acids in the sequence are cysteine; and/or at 
least 25% of the amino acids are non-natarally-occurring amino acids. 

(19] Th^ present invention also provides methods for identifying a 
25 polypeptide that binds to c-MET. In some embodiments, the method comprises, 
screening a library of polypeptides for affinity to c-MET; and 
selecting a polypeptide comprising at least one monomer domain that binds to c- 
MET, wherein the monomer domain: 
is a non-naturally-occurring monomer domain; 
30 comprises, at least one disulfide bond; and 

binds to an ion. 

[20] In some embodiments, the selected polypeptide comprises a monomer 
domain comprising any of the following; 

Cxy^x [EQ] FxCkSTxRC [IV] xxxWxCDGDNDCEDxSDEx 
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Cxxxx lEQ] FECxSTxRC [IVJ xxxWxCUGxNDCEDxSDEx 
Ckxxx [EQ] FxCxSTxRC ULV] Pxx^KfxCDGxxDCSDxSDHlxx 
Cxxx [EQ3 FQCxSTxRC [IV] PxxWxCDGxNDCEDSSDExxC 
Cxxxx [EQ] FxCxxxxxC [ILV] xxxxxxxxxxDCxDxSDEx 
5 Cxxx [EQ] FxCxSTGRCxPxxWxCxGxNDCEDxSDEx 

Cxxxx [EQJ FxCxSTxRCEILVjxxxWxCxxxxDCxDxSDxxxxxCx 
Cxxx [EQ] FxCxxxxxC [ ILV] xxxHxCDGxNDCxDxSxBxxxxC 
Cxxxx [EQ] FxCxSTxRC [ ILV] PxxV^xCxGxxDCxDxSDEx 
Cxxxx [EQ] FxCxxxxxC [ILV] xxxV^xCDGxxDCxDxSDEx 
10 EFXCXNGXCIPXXWXCDGXDDCGDXSDE . 

[21| In some embodimentSj the selecting step comprises selecting a 
polypeptide that reduces HGF-mediated cell proliferation and/or migration. In some 
embodiments^ the method further comprises selecting a polypeptide that inhibits trnnor 
growth in an atiimal, 

15 [22| In some embodiments, the monomer domain comprises an amino acid 

sequence in which at least 10% of the amino acids in. the sequence are cysteine; and/or at 
least 25% of the amino acids are non-naturaily-occurring amino acids, 

[23] In some embodiments^ the method further comprises 
linking the monomer domain in the selected polypeptide to a second monomer 
20 domain to form a library of multimersj each multimer comprising at least two 

monomer domains; 

screening the library of multimers for the ability to bind to c-MET; and 
selecting a mul timer that binds c-MET. 

[24] In some embodiments, the me&od forther comprises 
25 linking the monomer domain in the selected polypeptide to a second monomer 

domain to form a library of multimers, each multimer comprising at least two 
monomer domains; 

screening the library of multimers for the ability to bind to a target molecule other 
than the c-MET; and 
30 selecting a multimer that binds to the target molecule. 

[25] In some embodiments, the method ftirther comprises a step of mutating 
at least one monomer domain, thereby providing a library comprising mutated monomer 
domains. 
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[26] In some embodiments, the library ot monomer domains is expressed as 
a phage display^ ribosome display or cell surface display. 

[27J In some embodiments, the polypeptide comprises at least two 
monomer domains and the monomer domains are linked by a linker. In some embodimentSj 
5 the linker is a peptide linker. In some embodiments, the linker is between 4 to 1 2 amino 
acids long. 

[28] In some embodiments, the monomer domains are each between 35 to 
45 amino acids. 

[29] In some embodiments, each monomer domain comprises two disulfide 
10 bonds, in some embodiments^ each monomer domain comprises three disulfide bonds. 

[30] In some embodiments, the ion is a metal ion. In some embodiments, 
the ion is a calcium ion. 

[31] In some embodiments, at least one of the monomer domains is derived 
from a LDL-receptor class A domain. In some embodiments^^ at least one of the monomer 
15 domains is derived from an EGF-Hke domain. 

[32] In some embodiments, the monomer domain comprises an amino acid 
sequence in which at least 10% of the amino acids in the sequence are cysteine; and/or at 
least 25% of the amino acids are non-naturally-occurring amino acids. 

[33] The present invention also provides polynucleotides encoding a 
20 polypeptide comprising a monomer domain that binds to c-METj wherein the monomer 
domain: 

is a non-naturally-occnrring monomer domain consisting of 30 to 50 amino acids; 

comprises at least one disulfide bond. 

The present invention also provides 
25 [34] A polypeptide comprising a monomer domain that binds to 

immunogiobulin~G (IgG), wherein the monomer domain is an LDL receptor class A 
monomer domain comprising sequence selected fi:om the following: 

CXSSGRCIPXXWVCDGXXDCRDXSDE, 

CXSSGRCIPXXWLCDGXXDCRDXSDE, mid 
30 [EQ]FXCRX[ST]XRC[IV]XXXW[ILV]CDOXXDCXD[DN]SDE 

wherein X is any amino acid and amino acids in brackets are alternative amino acids 

at a single position; and 
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wherein the polypeptide has an irtcreasea semm ixalHite when the polypeptide is 
injected into an animal compared to the sermn haJf-life of a polypeptide lacking the 
monomer domain that binds to IgG. 

[35J In soroe embodiraentSj the monomer domain comprises 
5 CHPTGQFRCRSSGRCVSPTWVCDGDNDCGDNSDEENCSAPASEPPGSL. In some 
embodiments, the monomer domain comprises 
CHPTGQFRCRSSGRCVSPTWVCDGDNDCGDNSDEEKC. 

[361 ^ Kome embodiments, the polypeptide comprises a second monomer 
domain with binding specificity for a molecnle other than IgG, wherein the second monomer 
10 domain: 

has between 30-100 amino acids; 

is a non-natmraily-occurring monomei' domain; 

comprises at least one disulfide bond. 

[37| In some embodiments, the second monomer domain is a non- 
1 5 naturalJy-occnring LDL-receptor class A domain. 

[38] The present invention also provides polynncleotides that encode the 
polypeptides as described above, 

DEFINITIONS 

20 [39] Unless otherwise indicated, the following definitions supplant those in 

the art. 

{40] "Met" also referred to as "c-MET^" refers to the Hepatocyte Growth 
Factor/Scatter Factor (HGF/SF)"binding receptor tyrosine kinase. In response to HGF 
treatment a range of activities ai'e observed: phosphorylation of receptorj docking of signaling 
25 intermediates Gab-1/Grb2, culminating in activation of kinases such as PI3K, BRKl and 2, 
and AKT. These activities aid in cell growth, survival^ migration, and neovascularisation. 
See, e.g., Birchmeier et al.Mol Cell Biol 4:915-925 (2003), The amino acid sequence of 
Met is known and is displayed in SEQ ID NO:L See^ e.g.. Park et al.Proc, Natl Acad. Set 
84(1 8):6379 (1987). 

30 [4 1) The tenns "monomer domain" or '^monomer" are used interchangeably 

and herein refer to a discrete region found in a protein or polypeptide. A monomer domain 
forms a native three-dimensional structure in solution in the absence of flanking native amino 
acid sequences. Monomer domains of the invention will often bind to a target molecule. For 
example, a polypeptide that forms a three-dimensional structure that binds to a target 
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molecule is a monomer domain. As used herein, tne term "monomer domain" does not 
encompass the complementarity determining region (CDR) of an antibody, 

[42] The term "loop" refers to that portion of a monomer domain that is 
typically exposed to the environment by the assembly of the scaffold structure of the 
5 monomer domain protein, and which is involved in target binding. The present invention 
provides three types of loops that are identified by specific features, such as^ potential for 
disulfide bonding^ bridging between secondary protein structures, and molecular dynamics 
(ix,j flexibility). The three typ^ of loop sequences are a cysteine-defined loop sequence^ a 
structure-defined loop sequence^ and a B-factor-defined loop sequence. 

10 {43 j As used herein, the term "cysteine-defined loop sequence" refers to a 

subsequence of a naturally occurring monomer domain-encoding sequence that is bound at 
each end by a cysteine residue ^-hat is conserved with respect to at least one other naturally 
occurring monomer domain of the same family. Cysteine-defined loop sequences are 
identified by multiple sequence alignment of the naturally occurring monomer domains^ 

1 5 followed by sequence analysis to identify conserved cysteine residues. The sequence 
between each consecutive pair of conserved cysteine residues is a cysteine-defined loop 
sequence. The cysteine-defined loop sequence does not include the cysteine residues 
adjacent to each temiinus. Monomer domains having cysteine-defined loop sequences 
include the LDL receptor A-domainS:^ EGF-hke domains^ sushi domains, Fibronectin type 1 

20 domains, and the like. Thus, for example, in the case of LDL receptor A-domains 

represented by the consensus sequence^ CX6CX4CX6CX5CXSC, wherein X^, X4, X5, and Xg 
each represent a cysteine-defined loop sequence comprising the designated number of amino 
acids, 

[44] As used herein^ the temi "structtire-defined loop sequence" refers to a 
25 subsequence of a monomer-domain encoding sequence that is bound at each end to 

subsequences that each fonn a secondary structure. Secondary structures for proteins with 
known three dimensional structures are identified in accordance wiflx the algorithm STRIDE 
for assigning protein secondary structure as described in Frishman^ D, and Argos, P. (1995) 
"Knowledge-based secondary structure assignment/ Proteins, 23(4):566-79 (see also 
30 //hgmp.mrc.ac.uk/Registered/Option/stride.html at the World Wide Web). Secondaiy 
structures for proteins with unknown or uncharacterized three dimensional structures are 
identified in accordance with the algorithm described in Jones, D.T. (1999), "Protein 
secondary structure prediction based on position-specific scoring matiices/' J. Mol. BioL . 
292:195-202 (see also McGuffin, Bryson, K., Jones, DT, (2000) "The PSIPRED protein 
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structoe prediction server/^ Sloia&iBMcs. lb:4U4-405, and //bioinfxs.uclaauk/psipred/ at 
the World Wide Web). Secondary structures include, for example, pleated sheets, helices, 
and the like. Examples of monomer domains having struotee-defined loop sequences are the 
C2 domains, Ig domams. Factor 5/8 C domains, Fibronectin type 3 domains, and fee like, 

[45] The term "B-factor'-defined loop sequence^* refers to a subsequence of 
at least three amino acid residues of a monomer-domain encoding sequence in which the B- 
factors for the alpha carbons in the B^factor-defeied loop are among the 25% highest alpha 
carbon B factors in the entire monomer domain. Typically the average alpha-carbon B^-factor 
for the subsequence is at least about 65. As used herein, the term ^'B-factor" (or "temperature 
factor" or "Debye- Waller factor") is derived from X-ray scattering data. The B-factor is a 
factor that can be applied to the X-ray scattering term for each atom, or for groups of atoms> 
that describes the degree to which electron density is spread out B-factors employed in the 
practice of the present invention may be either isotropic or anisotropic. The term "average 
alpha-carbon B-factor" refers to: 

n 

(S B"factorcai)/n 

where n corresponds to the number of residues in the loop, and is at least 3, and B-factorcai is 
the B"factor for the alpha carbon of amino acid residue i of the loop. 

{46] The term "multimer" is used herein to indicate a polypeptide 
comprising at least two monomer domains. The separate monomer domains in a multimer 
can be joined together by a linker. A multimer is also known as a combinatorial mosaic 
protein or a recombinant mosaic protein. 

[47] The tenn "family" and '^family class'' are used interchangeably to 
indicate proteins that are grouped together based on similarities in their amino acid 
sequences. These similar sequences are generally conserved because they are important for 
the function of the protein and/or the maintenance of the three dimensional structure of the 
protein. Examples of such families include the LDL Receptor A-domain family, the EGF- 
like family, and the like. Additionally, related sequences that bind to the same target 
molecule can be divided into families based on common sequence motifs. 

[48] The term "ligand " also referred to herein as a "target molecule^' 
encompasses a wide variety of substances and molecules, which range from simple molecules 
to complex targets. Target molecules can be proteins, nucleic acids, lipids, carbohydrates or 
any other molecule capable of recognition by a polypeptide domain. For example, a target 
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molecule can include a chemical compound (i.e., aon-biological compoxmd such as^ e.g., an 
organic molecule^ an inorganic molecule, or a molecule having both organic and inorganic 
atomsj but excluding polynucleotides and proteins), a notixture of chemical compounds, an 
array of spatially localized compounds, a biological macromoleculej a bacteriophage peptide 

5 display library, a polysome peptide display library^ an extract made from a biological 

materials such as bacteria, plants, fungi, or animal (ag., mammalian) cells or tissue, a protein, 
a toxin, a peptide hormone, a cell, a virus, or the like* Other target molecules include, e,g,, a 
whole cell, a whole tissue, a mixture of related or unrelated proteins, a mixture of viruses or 
bacterial strains or the like. Target molecules can also be defined by inclusion in sa-eening 

10 assays described herein or by enhancing or inhibiting a specific protem interaction (i*e., an 
agent that selectively inhibits a binding interaction between two predetermined polypeptides). 

[491 The term '"linker" is used herein to indicate a moiety or group of 
moieties that joins or connects two or more discrete separate monomer domains. The linker 
allows the discrete separate monomer domains to remain separate when joined together in a 

1 5 multimer. The linker moiety is typically a substantially Hnear moiety. Suitable linkers 

include polypeptides, polynucleic acids, peptide nucleic acids and the like. Suitable linkers 
also include optionally substituted alkylene moieties that have one or more oxygen atoms 
incorporated in the carbon backbone. Typically, the molecular weight of the linker is less 
than about 2000 daltons. More typically, fhe molecular weight of the linker is tes than about 

20 1500 daltons and usually is less than about 1000 daltons. The linker can be small enough to 
allow the discrete separate monomer domains to cooperate, e.g., where each of the discrete 
separate monomer domains in a multimer binds to the same target molecule via separate 
binding sites. Exemplary linlcers include a polynucleotide encoding a polypeptide, or a 
polypeptide of amino acids or other non-naturally occurring moieties. The linker can be a 

25 portion of a native sequence, a variant thereof, or a synthetic sequence. Linkers can 

comprise, e.g., naturally occurring, non-naturally occurring amino acids, or a combination of 
both. 

[SO] The term ^'separate" is used herein to indicate a property of a moiety 
that is independent and remains independent even when complexed with other moieties, 
30 including for example, other monomer domains, A monomer domain is a separate domain in 
a protein because it has an independent property that can be recognized and separated from 
the protein. For instance, the ligand binding ability of the A-domain in the LDLR is an 
independent property. Other examples of separate include the separate monomer domains in 
a multimer that remain separate independent domains even when complexed or joined 



9 



wo 2006/009888 PCT/US2M5/021558 

together in Uiemisltimer by a linker. Another example of a separate property is the separate 
binding sites in a multimer for a ligand. 

[51] As used herein, "directed evolution" refers to a process by which 
polynucleotide variants are generated, expressed, and screened for an activity (e.g., a 
polypeptide with binding activity) in a reciarsiYe proce^. One or more candidates in the 
screen are selected and the process is then repeated using polynucleotides that encode the 
selected candidates to generate new variants. Directed evolution involves at least two rounds 
of variation generation and can include 3, 4, 5, 10, 20 or more rounds of variation generation 
and selection. Variation can be generated by any method known to those of skill in the art, 
including, e.g., by error-prone PCR, gene recombination, chemical mutagenesis and the like. 

(52] The term "shuffling" is used herein to indicate recombination between 
non-identical sequences. In some embodiments, shuffling can include crossover via 
homologous recombination or via non-homologous recombination, such as via cre/lox and/or 
flp/fit systems. Shuffling can be carried out by employing a variety of different formats, 
including for example, in vitro and in vivo shuffling formats, in silico shufQing formats, 
shuffling formats that utilize either double-stranded or single-sbanded templates, primer 
based shuffling fonnats, nucleic acid fragmentation-based shuffling formats, and 
oligonucleotide-mediated shuffling formats, all of which are based on recombination events 
between non-identical sequences and are described in more detail or referenced herein below, 
as well as other similar recombination-based formats. The term "random" as used herein 
refers to a polynucleotide sequence or an amino add sequence composed of two or more 
amino acids and constructed by a stochastic or random process. The mdom polynucleotide 
sequence or amino acid sequence can include framework or scaffolding motife, which can 
comprise invariant sequences. 

[53] The term "pseudorandom" as used herein refers to a set of sequences, 
polynucleotide or polypeptide, that have limited variability, so that the degree of residue 
variability at some positions is limited, but any pseudorandom position is allowed at least 
some degree of residue variation, 

[54] The terms "polypeptide," "peptide," and "protein" are used herein 
interchangeably to refer to an amino acid sequence of two or more amino acids. 

[55] The tenn "amino acid" refers to naturally occurring and synthetic 
amino acids, as well as amino acid analogs and amino acid mimetics that function in a 
manner similar to the naturally occurring amino acids. Naturally occurring amino acids are 
those encoded by the genetic code, as well as those amino acids that are later modified, e.g., 
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hydroxyproHne, y-carboxyglutamate, and O-phosphoserme. Amino acid analogs refers to 
compounds that have the same l>asic chemical structure as a naturally occurring amino acid^ 
Le., an a carbon that is bound to a hydrogen^ a carboxyl group, an amino group, and an R 
group, e.g,, homoserine, norieucinej methionine sulfoxide, methionine methyl sulfonium, 
5 Such analogs have modified R groups (e.g. , norleucine) or modified peptide backbones, but 
retain the same basic chemical structure as a naturally occurring amino acid, "Amino acid 
mimetics" refers to chemical compounds that have a structure that is different from the 
general chemical structure of an amino acid, but that fanctions in a manner similar to a 
naturally occurring amino acid. 
1 0 [56] "Conservative amino acid substitution" refers to the interchangeability 

of residues having similar side chains. For example, a group of amino acids having aliphatic 
side chains is glycine, alanine, valine, leucine, and isoleucine; a ^oup of amino acids having 
aliphatic-hydroxyl side chains is sexine and threonine; a ^oup of amino acids having amide- 
containing side chains is asparagine and glutamine; a group of amino acids having aromatic 
1 5 side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic 
side chains is lysine, arginine, and histidine; and a group of amino acids having sulfer- 
containing side chains is cysteine and methionine, Prefeited conservative amino acids 
substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-ar^nine, 
alanine-valine, and asparagine-glutamine. 
20 157] The phrase ^^nucleic acid sequence" refers to a single or double- 

stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5* to fhe 3* 
end or an analog thereof. 

[58] The term ^'encoding'' refers to a polynucleotide sequence encoding one 
or more amino acids. The term does not require a start or stop codon. An amino acid 
25 sequence can be encoded in any one of si:?c different reading frames provided by a 
polynucleotide sequence. 

[59] The term "promoter" refers to regions or sequence located upstream 
and/or downstream from the start of transcription that are involved in recognition and binding 
of RNA polymerase and other proteins to initiate transcription. 
30 [60] A "vector" refers to a polynucleotide, which when independent of the 

^ host chromosome, is capable of replication in a host organism. Examples of vectors include 
plasmids. Vectors typicallyhave an origin of replication. Vectors can comprise, e.g., 
transciiption and translation terminators^ transcription and translation initiation sequencesj 
and promoters nseful for regulation of the expression of the particular nucleic acid, 

11 
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[61] The term 'Vecombinanf wnen usea witn reterence, e.g., to a cell, or 
mcleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has 
been modified by the introduction of a heterologoiis niicleic acid or protein or the alteration 
of a native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, 
5 for example, recombinant cells express genes that are not found within the native 

(nonrecombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under-expressed or not expressed at alL 

[62] The phrase "specifically (or selectively) binds" to a polypeptide^ when 
referring to a monomer or multimer, refers to a binding reaction that can be determinative of 

1 0 the presence of the polypeptide in a heterogeneous population of proteins (e.g., a cell ox tissne 
lysate) and other biolo^cs. Thns, under standard conditions or assays used in antibody 
binding assays^ the specified monomer or mnltimer binds to a particular target molecule 
above background (e.g., 2X, 5X^ 1 OX or more above background) and does not bind in a 
significant amount to other molecules present in the sample. 

1 5 [63] The terms "identical" or percent "identity/' in the context of two or 

more nucleic acids or polypeptide sequences, refer to t^o or more sequences or subsequences 
that are the same, "Substantially identical" refers to two or more nucleic acids or polypeptide 
sequences having a specijRed percentage of amino acid residues or nucleotides that are the 
same (Le,, 60% identity, optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity over a 

20 specified region^ or, when not specif! edj over the entire sequence), when compared and 
aligned for maximum correspondence over a comparison window, or designated region as 
measured using one of the following sequence comparison algorithms or by manual 
alignment and visual inspection. Optionallyj the identity or substantial identity exists over a 
region that is at least about 50 nucleotides in length, or more preferably over a region that is 

25 100 to 500 or 1000 or more nucleotides or amino acids in length, 

[64] A polynucleotide or amino acid sequence is "heterologous to" a second 
sequence if the two sequences are not linked in the same manner as found in naturally- 
occurring sequences. For example, a promoter operably linked to a heterologous coding 
sequence refers to a coding sequence which is different from any naturally-occurring allelic 

30 variants. The term "heterologous linlter," when used in reference to a multimer, indicates 
that the multimer comprises a linker and a monomer that are not found in t!ie same 
relationship to each other in nature {e.g., they form a non-naturally occurring fusion protein). 

[65] A "non-naturally-occurring amino acid" in a protein sequence refers to 
any amino acid other than the amino acid that occurs in the corresponding position in an 
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alignment with a naturany-occurring poiypeptiae witn the lowest smallest sxm probability 
where the comparison window is the length of the raonomer domain queried and when 
compared to a naturally-occuring sequence in the non-redundant ("nr") database of Genbank 
using BLAST 2.0 as described herein. 

5 [66] ""'Percentage of sequence identity" is determined by comparing two 

optimally aligned sequences over a compaiison window^ wh^ein the portion of the 
polynucleotide sequence in the comparison window may comprise additions or deletions (ie. , 
gaps) as compared to the reference sequence (which does not comprise additions or deletions) 
for optimal alignment of the two sequences. The percentage is calculated by determining the 

1 0 number of positions at which the identical nucleic add base ox amino acid residue occurs in 
both sequences to yield the number of matched positions^ dividing the number of matched 
positions by the total number of positionr in the window of comparison and multiplying the 
result by 100 to yield the percentage of sequence identity. 

[67] The terms "identical'* or percent '*identity,^' in the context of two or 

15 more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same or have a specified percentage of amino acid residues or nucleotides that are 
the same, when compared and aliped for maximum correspondence over a comparison 
window, or designated region as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspectiom Such sequences are then said to be 

20 "substantially identical.'' This definition also refers to the complement of a test sequence, 
Optionallyj the identity exists over a region that is at least about 50 amino acids or 
nucleotides in lengthy or more preferably over a region that is 75-100 amino acids or 
nucleotides in length, 

[68] For sequence comparison^ typically one sequence acts as a reference 

25 sequence, to which test sequences are compared. When using a sequence comparison 

algoritfuHj test and reference sequences are entered into a computer, subsequence coordinates 
are designated j if necessary^ and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. The 
sequence comparison algorithm then calculates the percent sequence identities for the test 

30 sequences relative to the reference sequence, based on the program parameters. 

[69] A "comparison window''^ as used herein, includes reference to a 
segment of any one of the number of contiguous positions selected from the group consisting 
of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of 
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coatiguo\is positions after the two seqxieuces are optimally aliped. Methods of alignment of 
sequences for comparison are well-known in the art. Optimal alignment of sequences for 
comparison can be conducted, e.g., hy the local homology algorithm of Smith and Watennan 
(1970) Adv. Appl Matk 2:482c, by &e homology aligmnent algorithm of Needleman and 
Wmsch (1 970) J. Mol Biol 48:443, by the search for similarity method of Pearson and 
Lipman (1 988) Proc. Nafl Acad. Scl USA 85:2444, by computerized implementations of 
these algoriihms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual 
alignment and visual inspection (see, e.g., Ausubel etaL, Current Protocols in Molecular 
Biology (1995 supplement)). 

[70i One example of a useful algorithm is the BLAST 2.0 algoxilhm, which 
is described in Altschul et al (1990) J. Mol. Biol 215:403-410, respectively. Software for 
performing BLAST analj^es is publicly available through the National Center for 
Biotechnology Information Oittp://www.ncbi.nlm,mh.gov/). This algorithm involves first 
identifying high scoring sequence pairs (HSFs) by identifying short words of length W in the 
query sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word 
hits act as seeds for mitiating searches to find longer HSPs containing them. The word hits 
ai^e extended in both directions along each sequence for as far as the cumulative aiignment 
score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the 
parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score 
for mismatching residues; always < 0). For amino acid sequence, a scoring matrix is used to 
calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative aiignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine tine sensitivity and speed of the alignment The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation 
(E) or 10, M^5, N===-4 and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the 
BLOSUM62 scoring matrix {see Henikoff and Henikoff (1989) Proc. Natl Acad. Sci. USA 
89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N--4, and a comparison of both 
strands. 
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[71] The BLAST algorithm also pertbrxns a statistical analysis of the 
similarity between two sequences (see, e.g., Kariin and Altschul (1 993) Proa Natl Acad. ScL 
USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N)X which provides an indication of the probability by which a 
5 match between two nucleotide or amino acid sequences would occur by chance. For 
example, a nucleic acid is considered similar to a reference sequence if the smallest sum 
probability in a comparison of the test nucleic acid to the reference nucleic acid is less than 
about 02, more preferably less than about 0.0 1 ^ and most preferably less than about 0,00 1 . 



10 BRIEF DESCRIPTIOTS! OF THE DRAWINGS 

[72J Fi^e 1 schematically illustrates the alignment of partial amino acid 
sequence from a variety of the LDL-receptor class A-domains to demonstrate the conserved 
cysteines. The connectivity of cysteines in the three disulfide bonds of the folded domain is 
illustrated schematically on the consensus sequence. Residues whose side-chains contribute 

15 to calcium binding are designated with an asterisk in the consensus sequence. 

[73] Figure 2, panel A schematically illustrates an example of an A-domain. 
Panel A schematically illustrates conserved amino acids in an A-domain of about 40 amino 
acids long. The conserved cysteine residues are indicated by and the conserved negatively 
charged amino acids are indicated by a circle with a minus ("-") sign. Circles with an '"H" 

20 indicate conserved hydrophobic residues. Panel B schematically illustrates two folded A- 
domains connected via a linker. Panel B also indicates two calcium binding sites, dark 
circles with Ca^^^ and three disulfide bonds within each folded A-domain for a total of 6 
disulfide bonds, 

(74] Figure 3 indicates some of the Ugands recognized by naturally- 
25 occurring members of the LDL-receptor family, which include inhibitors^ proteases, protease 
complexes^ vitamin-carrier complexes, proteins involved in lipoprotein metabolism, non- 
hmnan ligands, antibiotics, viruseSj and others, 

[75] Figure 4 schematically illustrates a general scheme for identifying 
monomer domains that bind to a ligand, isolating the selected monomer domains, creating 
30 multimers of the selected monomer domains by joining the selected monomer domains in 
various combinations and screening the multimers to identify multimers comprising more ' 
than one monomer that binds to a liganA 

[76] Figure 5 is a schematic representation of another selection strategy 
(guided selection), A monomer domain with appropriate binding propeilies is identified firom 
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a library of monomer domains. The identified monomer domain is then linked to monomer 
domains firoixi another library of monomer domains to form a library of multimers* The 
mnltimer library is screened to identify a pair of monomer domains that bind simultaneoxisly 
to the target. This process can then be repeated until the optimal binding properties are 
5 obtained in the multimer. 

[77] Figure 6 depicts an ahgnment of A domains. At the top and the bottom 
of the figure, small letters (a-q) indicate conserved residues. 

[78] Figure 7 illustrates various possible antibody-monomer or multimer 
confomiations. In some embodimentSj the monomer or multimer replaces the Fab fi-agment 
iO of the antibody. 

|79J Figure 8 depicts a possible conformation of a multimer of the invention 
comprising at least one monomer domain fliat binds to a half-life extending moletaie and 
other monomer domains binding to one or optionally two or more target molecules, hi the 
FigurCj two monomer domains bind to two first target molecules. Optionally, the two 
15 monomer domains can bind to different sites on one first target molecule (not depicted), 

|80] Figure 9 shows a comparison between c-METFc, a c-MET-specific 
monoiaer (M26) and a c-MET-specific dimer (RM12; RecM12) with regards to their relative 
abilities to block HGF-induced proliferation of serum-starved A549»SC human lung 
adenocarcinoma cells, 

20 (81] Figure 1 0 illustrates the serum half-life in monkeys of monomers that 

bind to IgG. 

DETAILED DESCRIPTION OF THE INVENTION 
L INTRODUCTION 

25 [82] The present invention provides for non~nati3rally-occurring proteins 

that bind to c-MET, Generally^ the proteins of the present invention comprise a domain that 
binds to c-MET, These domains may be readily identified using a variety of polypeptide 
scaffolds to generate a plurality of polypeptide valiants and then selecting a variant that binds 
to c-MET. The present invention therefore also provides for selecting a protein that binds to 

30 c-MET. Proteins that bind c-MET are useful^ e,g., for treating mdividuals with solid tumors 
that express c-MBT, The polypeptides of the invention are also useM to detect tissues in 
which Met is expressed and can be used to target molecules to those tissues. 

[83] c-MET is inactive in its resting monomer state and dimer formation 
results in receptor activation (often even in absence of ligand binding). The mature form of 
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the receptor consists of a solely extracellular a chain and a longer (3 chain encompassing the 
r^ainder of the extracellular dommi}, a transmembrane domain and a cytoplasmic tail The 
cytoplasmic tail contains the jnxtamembrane domain^ a Idnase domain and docldng sites for 
signaling intennediates. The a chain and the Srst 212 amino acids of the p chain^ also known 

5 as the Sema domain (ICong-Beltran, et al , Cancer Cell 6:75-84 (2004), are sufficient for 
binding to HGF. The rest of the extracellnlar portion of the p chain consists of a cysteine- 
rich C domain and fonr repeats of an unusual immunoglobnlin domain. Accordingly, in some 
embodiments J the polypeptides of the invention comprise at least one monomer domain that 
inhibits dimerization of c-MET a and p chains and/or ftinctions as an antagonist to prevent 

1 0 ligands of c-MET from binding and/or activating c-MET, 

[84] While the present invention provides for polypeptides comprismg 
single domains, mnltimers of the domains may also be synthesized and used. In some 
embodiments^ all of the domains of the mnltimer bind c-MET. hi some of these 
embodiments^ each of the domains are identical and bind to the same portion (i.e., "epitope") 

15 of c-MET, For example, in some embodimentSj the monomer domains bind to the Seana 
domain of c-MET. hi other embodiments, at least some of the domains in the mnltimer bind 
to different portions of c-MET, hi yet other embodimentSj at least some of the domains of 
the polypeptide bind to a molecule or molecules other than c-MET (e.g. ^ a blood factor such 
as serum albumin, immunoglobulin, or erythrocytes), 

20 

//, MONOMERS 

[85] Monomer domains can be polypeptide chains of any size. In some 
embodimentSj monomer domains have about 25 to about 500, about 30 to about 200, about 
30 to about 100, about 35 to about 50, about 35 to about 100, about 90 to about 200, about 30 

25 to about 250, about 30 to about 60, about 9 to about 150, about 100 to about 150, about 25 to 
about 50, or about 30 to about 150 amino acids. Similarly, a monomer domain of the present 
invention can comprise, e.g., from about 30 to about 200 amino acids; j&om about 25 to about 
1 80 amino acids; from about 40 to about 150 amino acids; from about 50 to about 130 amino 
acids; or from about 75 to about 125 amino acids. Monomer domains can typically maintain 

30 a stable confonnation in solution, and ai"e often heat stable, e.g., stable at 95^ C for at least 10 
minutes without losing binding affinity. Sometimes, monomer domains can fold 
independently into a stable conformation. In one embodiment, the stable confonnation is 
stabilized by ions (e.g.j such as metal or calcium ions). The stable conformation can 
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Optionally contain disulfide bonds (cg.^ at least one, two, or three or more disulfide bonds). 
The disulfide bonds can optionally be formed between two cysteine residues. In some 
embodiments, monomer domains^ or monomer domain variants, are substantially identical to 
the sequences exemplified. 

5 

A. c-MET Binders 

[86] In some aspects^ the invention provides monomer domains that bind to 
a c-MET polypeptide or a portion thereof A portion of a polypeptide can be, e.g., at least 5, 
lOj 15, 20, 30j 50, 100, or more contiguous amino acids of the polypeptide. 
1 0 [87] A large number of c-MET binding sequences having an A domain 

scaffold were generated. As described in detail in the examples, ten families {le.^ Families 1- 
10, or "Fam 1-10'*) of monomer domains that bind to c-MET have been identified. The 
consensus motifs generated based on these families indicate common amino acid residues 
between c-MET binders. Sequence flanking the conserved residues comprising the motif are 
15 omitted from the motifj although it is assumed that all residues comprising the A-domain 

structure will be present in any binding domain basedon the families below. Those of skill in 
the ait will appreciate that positions where there is no consensus (marked with an "X") can be 
any amino acid, hi some embodiments, the amino acid at "X" positions will be selected from 
amino acids in the analogous position of one of the exemplified c-MET binders eitiier from 
20 the same family or a different family, 

(88) Family 1 has the following consensus motif; 
CXKX [EQ]FxCxSTxRCIIVlxxKWxCDGDISlDCKD>tSDSx 
[89] Exemplary sequences comprising the c-MET Family 1 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
25 each Family 1 sequence exemplified in the examples, 

[90] Family 2 has the following motif; 
CxKXx [EQ] FECxSTxRC [ IV] kxxWxCDGxNDCEDkSDKx 
[91] Exemplary sequences comprising the c-MET Family 2 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
30 each Family 2 sequence exemplified in the examples. 

[92] Family 3 has the following motif: 

Cxxxx[BQl FxCxSTxEC [ILV] PjcxWxCDGkxDCEDkSDExk 
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[93] Exemplary seqijences compnsmg tiie c-MET Family 3 motif are 
displayed m the examples. References to c-MET binding monomers or multimers encompass 
each Fmnily 3 sequence exemplified in the examples, 

[94J Family 4 has the following motif: 
5 Cxxx [EQ] FQCxSTkRC [ XV] PxxWxCDGxNDCEDSSDExkC 

[95] Exemplary sequences comprising ffie c-MET Family 4 motif ate 
displayed in the examples* References to c-MET binding monomers or mnltimers encompass 
each Family 4 sequence exemplified in the examples. 

[96] Family 5 has the following motif: 
10 Cxxxx [ EQ] FxCxxxKxC [ILV] xxxxxxxxxxDCxDxSDSx 

[91] Exemplary sequences comprising the c-MET Family 5 motif ate 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 5 sequence exemplified in the examples, 

[98] Family 6 has the following motif: 
15 CxxK [EQ] FxCxSTGKCxPxxWxCxGxNDCEDxSDEx 

[99] Exemplary sequences comprising flie c-MET Family 6 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 6 sequence exemplified in the examples, 

(100] Family 7 has the following motif: 
20 Cxxxx EEQ) FkCxSTxRC [ILV] xxxWxCxxxxDCxDxSDxxxxxCx 

[101] Exemplary sequences comprising the c-MET Family 7 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 7 sequence exemplified in the examples. 

(102] Family 8 has the following motif; 
25 Ckxx [EQ] FxCxxXKxC [ILV]xxxWxCDGxNDCxDxSxExxxxC 

[103] Exemplary sequences comprising the c-MET Family 8 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 8 sequence exempHfied in the examples. 

[104] Family 9 has the following motif: 
30 Cxxxx [EQ] FxCxSTxRC [ILVl PxxWxCxGxxDCxDxSDEx 

[105] Exemplary sequences comprising the c-MET Family 9 motif are 
displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 9 sequence exemplified in the examples, 

[106] Family 1 0 has the following motif: 
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Cxxxx [EQ] FxCxxxKxC [ILV] xxxWxCDGxxDCxDxSDEx 
which can be further condensed as: 

EFXCXNGXCIPXXWXCDGXDDCGDXSDE 

[107} Exemplary sequences comprising the c-MET Family 1 0 motif are 
5 displayed in the examples. References to c-MET binding monomers or multimers encompass 
each Family 10 sequence exemplified in the examples. 



B, IgG binders and serum hall-life extensioB 

[108J The invention fiulher provide monomer domains that bind to a blood 
1 0 factor {e.g.y serum albumin;, immunoglobulin^ or erythrocytes). 

[109] In some embodiments^ the the monomer domains bind to an 
immunoglobulin polypeptide or a portion thereof* 

[llOJ Two famihes {i.e., A domain Families 2 and 3) of monomer domains 
that bind to immunoglobulin have been identified. 
15 1111] Family 2 has the following motif: 

[EQj FXCRX [ST] XRC ( IV] XXXW [ILV] CDGXXDCXD [DN] SDE 
[112] Exemplary sequences comprising the IgG Family 2 motif are displayed 
in the examples. References to IgG binding monomers or multimers encompass each Family 
2 sequence exemplified in the examples. 
20 [113] Family 3 has either of the two following motifs: 

CXSSGRCIPXXWVCDGXXDCRDXSDE; or 

CXSSGRCIPXXWLCDGXXDCRDXSDE 

[114] Exemplary sequences comprising the IgG Family 3 motif are displayed 
in the examples. References to IgG binding monomers or multimers encompass each Family 
25 3 sequence exemplified in the examples. 

[115J Monomer domains that bind to red blood cells (RBC) or serum 
albumin (CSA) are described in U.S. Patent Publication No, 2005/0048512, and include, 
e.g.,: 

RBCA CRSSQFQCNDSRICIPGRWRCDGDNDCQDGSDETGCGDSHILPFSTPGPST 
30 RBCB CPAGEFPCKNGQCLPVTWLCDGVlSiDCLDGSDEKGCGRPGPGATSAPAA 
RBCll CPPDEFPCKNGQCIPQDWLCDGVNDCLDGSDEKDCGRPGPGATSAPAA 
CSA-A8 CGAGQFPCKNGHCLPLNLLCDGVNDCEDNSDEPSELCKALT 

1116) The present invention provides a method for extending the serum half- 
life of a protein, includingj e.g.^ a multimer of the invention or a protein of interest in an 
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animaL The protein of interest can be any protein with therapeutic, prophylactic, or 
otherwise desirable functionality. This method comprises first providing a monomer domain 
that has been identified as a binding protein that specifically binds to a half-life extender such 
as a blood-carried molecule or cell^ such as serum albumin (e^g,, human serum albumin), IgG^ 
5 red blood cells, etc. The half-life extender-binding monomer is then covalently linked to 
another monomer domain that has a binding affinity for the protein of interest (e^g.^ c-MET or 
a different target). This complex formation results in the half-Ufa extension protecting the 
multimer and/or bound protein(s) from proteolytic degradation and/or other removal of the 
multimer and/or protein(s) and thereby extending the half-Ufe of the protein and/or multimer, 

1 0 One variation of this use of the mvention includes the half-life extender-binding monomer 
covalently linked to the protein of interest. The protein of interest may include a monomer 
domain^ a multimer of monomer domains, or a synthetic drug. Altematively, monomers that 
bind to either immunoglobulins or erythrocytes could be generated using the above method 
and could be used for half-life extension. 

1 5 tll7I The half-life extender-binding multimers are typically multimers of at 

least two domains, chimeric domainSj or mutagenized domains (i.e., one that binds to Met 
and one that binds to the blood-carried molecule or cell). Suitable domains include all of 
those described herein, that are further screened and selected for binding to a half-life 
extender. The half-life extender-binding multimers are generated in accordance with the 

20 methods for making multimers described herein^ usingj for example^ monomer domains pre- 
screened for half-life extender -binding activity. The serum half-life of a molecule can be 
extended to be, e.g., at least 1, 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70 80, 90, 100, 150, 200, 250, 
400, 500 or more hours. 

25 C. Discussion of Monomer Doiijains 

[118] Monomer domains that are particularly suitable for use in the practice 
of the present invention are cysteine-rich domains comprising disulfide bonds, Cysteine-rich 
domains employed in the practice of the present invention typically do not form an a helix, a 
p sheet, or a p^barrel structure. Typically, the disulfide bonds promote folding of the domain 

30 into a three-dimensional structure. Usually, cysteine-rich domains have at least two disulfide 
bonds, more typically at least three disulfide bonds. In some embodiments, at least 5^ 10, 15 
or 20% of the amino acids in a monomer domain are cysteines. 
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[119] Domains can have any Bumber of characteristics. For example^ in 
some embodiments, the domains have low or no immunogenicity in an animal (e.g., a 
human). Domains can have a small size, hi some embodiments, the domains are small 
enough to penetrate skin or other tissues. Domains can have a range of in vivo half-lives or 
5 stabilities, 

[120] Illnsttative monomer domains suitable for use in the practice of the 
present invention include^ e.g.^ an EGF-like domain, a Kringle-domainj a fibronectin type I 
domain, a fibronectin type 11 domain^ a fibronectin type III domain, a PAN domain, a Gla 
domain, a SRCR domain, a Kimitz/Bovine pancreatic trypsin Inhibitor domain, a Kazal-type 

10 serine protease inhibitor domain, a Trefoil (P-type) domain, a von Willebrand factor type C 
domain, an Anaphylatoxin-like domain, a CUB domain, a thyroglobulin type I repeat, LDL- 
receptor class A domain, a Sushi domain, a Link domain, a Thrombospondin type I domain, 
an hnmnnoglobulin-like domain, a C4ype lectin domain, a MAM domain, a von Willebrand 
factor type A domain, a Somatomedin B domain, a WAP-type four disulfide core domain, a 

1 5 F5/8 type C domain, a Hemopexin domain, an SH2 domain, an SH3 domain, a Laminin-type 
EGF domain, a C2 domain, and other such domains known to those of ordinary skill in the 
art, as well as derivatives and/or variants thereof 

[121] In some embodiments, suitable monomer domains (e.g, domains with 
the ability to fold independently or with some limited assistance) can be selected from the 

20 families of protein domains that contain p-sandwich or p-barrel three dimensional structures 
as defined by such computational sequence analysis tools as Simple Modular Architecture 
Research Tool (SMART), see Shultz et al, SMART: a web-based tool for the study of 
genetically mobile domains, (2000) Nucleic Add s Res e arch 28(1 ):23 1-234) or CATH (see 
Pearl etML, Assigning genomic sequences to CATH, (2000) Nucleic Acids Research 

25 28(1):277"282), 

[122] In another embodiment, monomer domains of the present invention 
include domains other than a fibronectin type III domain, an anticalin domain and a Ig~Iike 
domain fi^om CTLA-4. Some aspects of these domatas are described in WOOl/64942 entitled 
"Protein scaffolds for antibody mimics and other binding proteins" by Lipovsek et al., 

30 published on September 7, 2001, W099/16873 entitled *'Anticalins" by Best^ et al, 

published April 8, 1999 and WO 00/60070 entitled "A polypeptide structure for use as a 
scaffold" by Desmet, et aL, published on October 12, 2000. 
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[123] As described supra, monomer domaitis are optionally cysteine rich. 
Suitable cysteine rich monomer domains include^ e.g., the L0L receptor class A domain C*A- 
domain") or the EGF domain. The monomer domains can also have a cluster of negatively 
charged residues. 

5 [124) Other features of monomer domains can include the ability to bind 

ligands or the ability to bind an ion (e.g*, Ca^"^ binding by ttie LDL receptor A-domain), 
Monomer domaitis that bind ions to maintain their secondary structure include, e.g., A 
domain, EGF domain, EF Hand (e.g., such as those found in present in calmodulin and 
ti'Oponin C), Cadherin domain, C-type lectin, C2 domain, Annexing Gla-domain, 

10 Trombospondin type 3 domain, all of which bind calcium, and zinc fingers (e.g., C2H2 type 
C3HC4 type (RING finger), Megrase Zinc binding domain, PHD finger, GATA zinc finger, 
FYVE zinc finger, B-box zinc finger), which bind zinc. Without interding to limit the 
invention, it is believed that ion-binding provides stability of secondary structure while 
providing sufficient flexibility to allow for numerous binding conformations depending on 

1 5 primary sequence. 

[125J As described herein, monomer domains may be selected for the ability 
to bind to targets other than the target that a homologous naturally occurring domain may 
bind. Thus, in some embodiments, the invention provides monomer domains (and multimers 
comprising such monomers) that do not bind to the target or the class or family of target 

20 proteins that a substantially identical naturally occurring domain may bind. 

[126] Characteristics of a monomer domain can include the ability to fold 
independently and the ahihty to form a stable structure. Thus, the structure of the monomer 
domain is often conserved, although the polynucleotide sequence encoding the monomer 
need not be conserved. For example, the A-domain structure is conserved among the 

25 members of the A-domain family, while the A-domain nucleic acid sequence is not. Thus, 
for example, a monomer domain is classified as an A-domain by its cysteine residues and its 
affinity for calcium, not necessarily by its nucleic acid sequence. See^ Figures 1 and 2. 

(127) Specifically, the A-domains (sometimes called "complement-type 
repeats" or "LDL receptor type or class A domains") contain about 30-50 or 30-65 amino 

30 acids. In some embodiments, the domains comprise about 35-45 amino acids and in some 
cases about 40 amino acids. Within the 30-50 amino adds, there are about 6 cysteine 
residues. Of the six cysteines, disulfide bonds typically are found between the following 
cysteines: CI and C3, 02 and C5, C4 and C6. The cysteine residues of the domain are 
disulfide lixiked to form a compact, stable, functionally independent moiety. 5ee, Figure 3. 
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Clusters of these repeats make up a ligand binding domain, and differential clustering can 
impart specificity with respect to the ligand binding. 

[128] Exemplary A domain sequences and consensus sequences are depicted 
in Figures 1 and % One typical consensus sequence uscM to identify A domains is the 

5 following: 0[VILMA]-X(5rC-[DNH]-X(3)-[DENQHT]^C-X(3,4)»[STADE]" [DEH]"[DE]" 
X(i^5)-Cj where the residues in brackets indicate possible residues at one position. "X(#^)" 
indicates number of residues. These residues can be any amino acid residue. Parentheticals 
containing two numbers refers to the range of amino acids that can occupy that position (e.g,, 
"[DE]-X(i^5)-C" means that the amino acids DE are followed by 1, 2, 3^ 4^ or 5 residues, 

10 followed by C). This consensus sequence only represents the portion of the A domain 

begimiing at the third cysteine. A second consensus is as follows: C'"X(3.(5)-C"-X(445)"C"X^6- 
7rC"[N,D]-X(3)-[D3.N,Q,H,S,T]-C^X(4-6rD-E-X(2-&rC. The second consensus predicts 
amino acid residues spanning all six cysteine residues. In some embodiments, A domain 
variants comprise sequences substantially identical to any of the above-described sequences. 

15 Note that reference to "LDL recq)tor class A" domain^ for the purposes of this inventionj is 
not intended to indicate origin or binding properties of the domain, 

[129] Additional exemplary A domains include the following sequence: 

CaXs-i sCbXj. \ 5C<;X6-7Q(D?N)X4CeX4.(5DEX2^8Cf 

wherein C is cysteinej Xu_ri represents between n and m nixmber of 
20 independently selected amino acids, and (D,N) indicates feat the position can be either D or 
N; and wherein Cg-Cc, Cb-Ce and Cd-Cf form disulfide bonds. 

£130 J To date, at least 1 90 naturally-occurring human A-domains are 
identified based on cDNA sequences, See^ e.g.^ Figure 6. Exemplary proteins containing 
naturally-occurring A-domains include^ e,g.^ complement components (e.g., C6, C7j C8, C9, 
25 and Factor I)^ serine proteases (e.g., enteropeptidase, matriptase, and corin), transmembrane 
proteins (e.g., ST7, LRP3, LRP5 and LRP6) and endocytic receptors (e,g., Sortilin-related 
receptor, LDL-receptorj VLDLR, LKPl, LRP2, and ApoER2). A domains and A domain 
variants can be readily employed in the practice of the present invention as monomer 
domains and variants thereof. Fiirther description of A domains can be found in the 
30 following publications and references cited therein: Howell and Hertz, Tiie LDL receptor 
gene family: signaling junctions during development^ (2001) Current Opinion in 
Neurobiologjy 11:74-81; Herz (2001), supra\ Krieger, The ''best'' of cholesterols, the ''worst'' 
ofcholesterols: A tale of two receptors, (1998) PNAS 95: 4077-4080; Goldstein and Brown, 
The Cholesterol Quartet, (2001) Science, 292: 1310-1312; and, Moestrup and Verroust, 

24 
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Megalin-and Cubilin-Mediated Endocytosis ofProiein-Bound Vitamins, Lipids, and 
Hormones in Polarized Epithelia, (2001) Aim. Rev.Niitr, 21:407-28. 

[131 J A number of other domain types can also be used to generate c-MET- 
binding monomer domains. 
5 [132] Exemplary EGF monomer domains include the sequence: 

CaX3_i4CbX3-7CcX4„j6CdXi.2C^S-23Cf 

wherein C is cysteine, Xn-m represents between n and m number of 
independently selected amino acids; and 

wherein Ca-Cc, Cb-Ce and Q-Cf form disulfide bonds. 

1 0 [133J Each of the domains described below employ exemplary motifs {I e. , 

scaffolds). Certain positions are marked x, indicating that any amino acid can occnpy the 
position. These positions can inclnde a number of different amino acid possibilities, thereby 
allowing for sequence diversity and thus affinity for different target molecules. Use of 
brackets in motifs indicates alternate possible amino acids within a position (e.g., "[^^9]'' 

1 5 indicates that either E, K or Q may be at that position). Use of parentheses in a motif 
indicates that that the positions within the parentheses may be present or absent (e.g., 
"([ekq])" indicates that the position is absent or either K, or Q may be at that position). 
When more than one '^x'* is used in parentheses (e.g., "(xx)'0, each x represents a possible 
position. Thus "(xx)" indicates that zero, one or two mmxo acids may be at that position(s), 

20 where each amino acid is mdependently selected from any amino acid, a represents an 
aromatic/hydrophobic amino acid such as, e.g., W, Y, F, or L; p represents a hydrophobic 
amino acid such as, e.g., Y, I, L, A, M, or F; % represents a smallor polar amino acid such as, 
e.g., A, S, or T; 5 represents a charged amino acid such as, e.g. , K, E, or D; s 
represents a small amino acid such as, e.g.,; V, A, S, or T; and ^ represents a negatively 

25 charged amino acid such as, e.g., D, or N. 

[134] Suitable domains include, e.g. thrombospondin type 1 domains, trefoil 
domains, and thyroglobulin domains. 

[135] Thrombospondin type 1 ("TSPl") domains contain about 30-50 or 30- 
65 amino acids. In some embodiments, the domains comprise about 35-55 amino acids and 

30 in some cases about 50 amino acids. Within the 35-55 amino acids, there are typically about 
4 to about 6 cysteine residues. Of the six cysteines, disulfide bonds typically are found 
between the following cysteines: CI and C5, C2 and C6, C3 and C4. The cysteine residues 
of the domain are disulfide linked to form a compact, stable, functionally independent moiety 
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comprising distorted beta strands. Clusters of these repeats make up a ligand binding 
domain^ and differential clustering can impart specificity with respect to the ligand binding, 
(136] Exemplary TSPl domain sequences and consensus sequences are as 

follows: 

5 ( 1 ) (xxxxxx)Ci xxxC2Xxxxx(x)xxxxxC3Xxxx(xxx)xxxxxC4Xxxxxx:(x)xxxC5(x)xxxxC6; 

(2) (wxxWxx)CixxxC2XxGxx(x)xRxxxC3XXxx(Pxx)xxxxxQxxxxxx(x)xxxC5{x)xxxxC6 

(3) (wxx Wxx)Ci sxtC2XxGxx{x)xRxrxC3Xxxx(Pxx)xxxxxC4Xxxxxx(x)xxxC5(x)xxxxC<s 
(4) 

(WxxWxx)Ci[Stnd][Vkaq][Tspl]C2Xx[Gq]xx(x)x[Re]x[Rlctvm]x[C3vld^ 
1 0 C4ldae]xxxxxx(x)xxxC5(x)xxxxC6. 
(5) 

(WjaWxx)C|[Stnd][Vkaq][Tspl3C2Xx[Gq]xx(x)x[Re3x[IUctviB]x[C3vldr]xxxx(IPq]x^^ 

C4ldae]xxxxxx(x)3CKxC5(x)xxxxC6; and 

(6) 

15 Ci[nst][aegiMqrstv][adenpqrst]C2[adetgs]xgx[ikqrstv3x[aqrst]x[alnuiY]x 
xx)C4XXXXxxx:':x(xx)C5XXXxC6 

[137] In some embodiments^ thi-ombospondin type 1 domain variants 
comprise sequences substantially identical to any of the above-described sequences. 

20 (138] To date, at least 1 677 naturally occurring thrombospondin domains 

have identified based on cDNA sequences. Exemplary proteins containing the naturally 
occurring thrombospondin domains include^ e.g., proteins in the complement pathway {e.g.y 
properdin, C6, C7, CSA^ C8B, and C9), exti^acellular matrix proteins (e,g,, mindin, 
spondiUp SCO-spondin^X circumsporozoite surface protein 2, and TRAP proteins of 

25 Plasmodium, Thrombospondin type 1 domains are further described in^ e,g.^ Rosmusz et al, 
BBRC 296:1 56 (2002); Higgins et al, J Immunol 155:5777^85 (1995); Schultz^Cherry et at, 
IBiol C^em,270:7304--7310(1995); Schultz-Cherry Biol Chem, 269:26783-8 

(1994); Bork, FEBSLett 327:125-30 (1993); and Leung-Hagesteijn et al. Cell 71:289-99 
(1992). 

30 [139] Another exemplary monomer domain suitable for use in the practice of 

the present invention is the trefoil domain. Trefoil monomer domains are typically about 
about 30-50 or 30-65 amino acids. In some embodiments^ the domains comprise about 35-55 
amino acids and in some cases about 45 amino acids. Within the 35-55 amino acidSj there 
are typically about 6 cysteine residues* Of the six cysteines, disulfide bonds typically are 

35 found bet-ween the following cysteines: CI and C5, C2 and C4j C3 and C6. 

[140] To date, at least 149 naturally occurring trefoil domains have 
identified based on cDNA sequences. Exemplary proteins containing naturally occurring 
trefoil domains include, e,g., protein pS2 (TFFl), spasmolytic peptide SP (TFF2X intestinal 
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trefoil factor (TFF3)^ intestinal surcease-isomaltase, md proteins which may be involved in 
defense against microbial infections by protecting the epithelia {e.g.,Xenopus xPl, xP4, 
integumentary mncins A.1 and C.l . Trefoil domains are further described in, e,g,, Sands and 
Podolsky.^mw. Physiol 58:253-273 (1996); Cmetal.PNAS USA 91:2206-2210 
5 (1994); DeA et al, PNAS USA 91 :1084-1088 (1994); Hoffinan et aL, Trends Biochem Sci 
18:239-243 (1993), 

[141] Exemplary trefoil domain sequences and consensus sequences are as 

folio v^s: 

( 1 ) C J (xx)xxxx?i:xxxxC2Xx(x)xxxxxxxC3XXXxC4C5XXxxx(x)xxxxxC6 
1 0 (2) C| (xx)xxxxxxRxxC2Xx(x)xxxxxxxC3XxxxC4C5Xxxxx(x)xxxxxC6 
(3) C)(xx)xxxpxxRxnC2gx(x)pxitxxxC3XxxgC4Csfdxxx(x)xxxpwC(5f 
(4) 

C J (xx)xxx[P vae]xxRx[ndpm] C2[Gaiy] [ypfst] ([de]x)[pskq]x[Ivap] [Tsa]xx[qedk] C3xx[ 
krhi][Gnk]C4C5[Fwy3[Dnrs][sdpnte3xx(x)xxx[pId][Weash]C6[Fy] 
15 (5) 

Cl(xx)xxx[Pvae]xxRx[ndpm]C2[Gaiy][ypfst]([de3x)[pslcq]x[Ivap][Tsa]xx[keqd]C3x 
x[laln][Gnk]C4C5[a][Dnrs][sdpnte3xx(x)xxx[pki][Weash3C6[Fy] 
(6) 

C i ([dnps])[adiMnprsly] [dfilmv] [adenprst] [adelpnf3[ehMnqrs][ade^msv][kqr] [fiHqrt^^ 
20 ]C2[agiy][flpsvy][dknpqs3[adf^p3[aipv][st][aegkpqrs][ade^pqs][dei^ 
gknqs][gn3C4C5[wyfh][deims3[adgnpst][aefgqlrstw3[giknsvmq](^^ 
iknpv]w)C(s 

(142) Another exemplary monomer domain suitable for nse in the present 
25 invention is the thyroglobnlin domain. Thyroglobulin monomer domains are typically about 
30-85 or 30-80 amino acids. In some embodiments^ the domains comprise about 35-75 
amino acids and in some cases about 65 amino acids. Within the 35-75 amino acids, there 
axe typically about 6 cysteine residues. Of the six cysteineSj disulfide bonds typically are 
fonnd between the following cysteines: CI and 02, C3 and C4, C5 and C6, 
30 [143] To date at least 251 naturally occurring thyroglobulin domains have 

been identified based on cDKA sequences. The N-terminal section of Tg contains 1 0 repeats 
of a domain of about 65 amino acids which is Icnown as the Tg type-1 repeat 
PUBMED:3595599, PUBMED: 8 797845. Exemplary proteins containing natm^ally occurring 
thyroglobulin domains include e.g., the HLA class 11 associated invariant chain, human 
35 pancreatic carcinoma marker proteins, nidogen (entactin), insulin-like growth factor binding 
proteins (IGFBP), saxiphiliu;, chum salmon egg cysteine proteinase inhibitor, and equistatin. 
The Thyr-1 and related domains belong to MEROPS proteinase inhibitor family 13 1^ clan IX. 
Thyroglobulin domains are further described in, e.g, Molina et aL, Ew\ J. Biochem, 240:125- 
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133 (1996); Guncar et ah, £^MSO J 18:793^803 (1999); Chong and Speicher, 1)^276:5804- 
5813 (2001). 

[144] Exemplary fhyroglobulin domain sequences and consensus sequences 

are as follows: 
5 (1) 

C5XXxx(x)xxxxxxxxxxxxxx(xx)x.C6 
(2) 

Cixxxxxxxxxxxxxxx(xxxxxxxxxx)xxxxxx:s:yxPxC2XxxGxxxxxQC3x(x)x(xxx)xxxxC4 
1 0 WC5 Vxxx(x)GxxxxGxxxxxxxx(xx)xC6 

(3) CjtXxxxxxxxxxxxxxx(xxxxxxxxxx)xxxxxxxyxPxC2XxxGxyxxxQC3X(x)s(xxx^ 
Vdxx(x)GxxxxGxxxxxgxx(xx)xC6 

(4) Ci[qerl]xxxxxxxxxxxxxx(xxxxxxxxxx)xxxxxxx[Yfhp]xPxC2XxxGx[YQxx[vM]Q 
a]xxx)xx[Gsa]xC4[W)fIC5V[]L)nyfl]xx(x)Gxxxx[Gdne]xxxxxgxx(xx)xQ 

15 (5)Ci[qerl]xxxxxxxxxxxxxx(xxxxxxxxxx)xxxxxxx[ahp]xPxC2xxxGx[a3xx[vM]QC3X 
xxx)xx[gas]xC4[a]C5V[DnajXx(x)Gxxxx[^g]x.s;xxxgxx(xx)xC6 

[145] Another exemplary monomer domain that can be used in the present 

invention is a laminin-EGF domain, Laminin-EGF domains are typically about 30-85 or 30- 

80 amino acids. In some embodiments, the domains comprise about 45-65 amino acids and 

20 in some cases about 50 amino acids. Within the 45-65 amino acids, there are typically about 
8 cysteine residues which interact to form 4 disulfide bonds. Laminins are a major 
noncollagenous component of basement membranesthat mediate cell adhesion, gi-owth 
migration^ and differentiation. They are composed of distinct but related alpha, beta, and 
gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and 

25 three short globular arms. The long arm consist of a coiled coil structure contiibuted by all 
three chains and cross-linked by interchain disulphide bonds. 

[146) Exemplary laminin EGF domain sequences and consensus sequences 

are as follows; 
(1) 

30 CJXC2XXXXXX(XXX)XXC3XXX(XXXXXX)XXXXC4XC5XXXXXXXXC6XXC7XXXXXXX(XXXXX)XXX 

xxCs 
(2) 

CixC2Xxxxxx(xxx)xxC3xxx(xxxxxx)xxgxC4xC5XXxxxGxxC6XxC7Xxxxxxx(xxxxx)xx 

xxxCg 
35 (3) 

C 1 XC2 [ndh]xxxxx(xxx)xxC3Xxx(xxxxxx)xxgxC4xC5XxxxxGxxC6[denq]xC7Xx [gn] [yf 
ht]xxx(xxxxx)xxxxxC8 

In some embodimentSj the monomer domain is a Notch/LNR monomer 

domain^ a DSL monomer domain^ an Anato monomer domain, an integrin beta monomer 

40 domain, and a Ca-EGF monomer domain. 
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[147] In some embodiments, the Ca-EGF monomer domain comprises the 
following sequence: 

DxdECiXx(xx)xxxxQx(xx)xxxxxC3xNxxGxfxC4x(xxx)xC5Xxgxxxxxxx^^^ 

[148] In some emhodiments, the Notch/LNR monomer domain, comprises 
the following sequence: CiXx(xx)xxxC2XxxxxnGxC3XXxC4nxxxC5XxDGxDC5 

[149] In some embodiments, the DSL monomer domam comprises the 
following sequence: CixxxYygKxC2XxfC3XXXxdxxxhxxC4XxxGxxxC5XxGWxGxxC6. 

[150] Anato monomer domain comprises the following seqnence: 
C]C2xdgxxxxx(x)xxxxC3exxxxxxxx(xx)xxC4XxxficxC5C6. 

[151] In some embodiments, the integrin beta monomer domain comprises 
the following sequence: C)XxC2XXxxpxC3XwC4xxxxfexx(gx)xxxxRC5dxxxxLxxxgC6; 
and "x" is any amino acid. 

[152] In some embodiments, C1-C5, C2-C4 and C3-C6 of the Notch/LNR 
monomer domain form disulfide bonds; and GrCs, C2-C4 and CrCe of the DSL monomer 
domain form disulfide bonds. 

[153] In some embodiments, the Ca-EGF monomer domain comprises the 

following sequence: 

D[P][Dn]ECiXx(xx)xxxxC2[pdgl(dx)xxxxxC3xNxxG[sgt][a]xC4x(xxx)x^ 

xxxxx(xxxxx)xxxCe. 

[154] In some embodiments, the Notch/LNR monomer domain, comprises 

the following sequence: Cixx(x[P 

a])xxxC2x[(!>s]xxx[(|j][Gk]xC3[nd]x[^sa]C4[<[>s]xx[aeg]C5x[a]DGxDC6. 

[155] In some embodiments, the DSL monomer domain comprises the 
following sequence; 

Cixxx[a][ah][Gsna]xxC2Xx[a]C3Xtpae]xx[Da]xx[xl][Hrgk][ak]xC4[dnsg]xxGxxxC5XX 
G[a]xGxxC6. 

[156] In some embodiments, the Arxato monomer domain comprises the 
following sequence: 

CiC2x[Dhtl][Ga]xxxx[plant](xx)xxxxC3[esqdat]x[Rlps]xxxxxx([gepa]x)xxC4Xx[av^^^ 
qvyJxxCsC^, 

[157] In some embodiments^ the integrin beta monomer domain comprises 
the following sequence: 

CtXxC2[P]xxEghds][Pk]xC3[x][oc]C4Xxxx[a]xxx([Gr]xx)x[x]xRC5[Dnae]xxxxL[|3k]xx[G 
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njCe; a is selected from; w, y, f, and 1; P is selected ja-oni: 1, 1, a, m, and f; x is 
selected from: g, a, and t; 5 is selected from; r, and d; s is selected from; v, a, 
s, and t; and ^ is selected from: d, and n. 

[158] In some embodiments, the Ca-EGF monomer domain comprises the 
following sequence: 

D[Yiiq[Dn]ECixx(xx)xxx?tC2[pdg](dx)xxxxxC3xNxxG[sgt][fy]xC4X(xxx)xC5 as 

Jxxxxxx(xxxxx)xxxC6^ 

[159] In some embodiments, the Notch/LNR monomer domain, comprises 

the following sequence: 

Ctxx(x[yifIv])xxxC2x[dens]xxx[Nde][Gk]xC3[nd]x[densa]C4[Nsde]xx[aeg]C5x[wyfl 
DC6. 

[160] In some embodiments, the DSL monomer domain comprises the 
following sequence: 

Cixxx[YwqtYfh][Gasn]xxC2XX[Fy]C3x[pae]xx[Da]xx[glast][Hxgk][ykfw 

xxxCsxxGtWlfyJxGxxQ. 

[161] In some embodiments, the Anato monomer domain comprises the 
following sequence; 

CiC2x[adeWt]gxxxxxxxx(x)[derst]C3XXXxxxxxx(xx[aersy3)C4Xx[apyt][ftnq][^^^ 
hqrsk](x)CsC6. 

[163] In some embodiments, the integrin beta monomer domain comprises 
the following sequence: 

Ci[aegkqrst][kreqd3C2til][aelqrv][vjlas][dghs][kp]xC3[ga5t][wy]C4XXXx[ 

ar]r)C5[and][dilrt][iklpqrv][adeps][aenq]l[iklqy]x[adknr][gn]C 

[163] Poiynncleotides encoding the monomer domains are typically 
employed to make monomer domains via expression, Nncleic acids that encode monomer 
domains can be derived from a variety of different sources. Libraries of monomer domains 
can be prepared by expressing a plurality of different nncleic acids encoding naturally 
occurring monomer domains, altered monomer domains (i.e., monomer domain variants), or 
a combinations thereof. For example, libraries may be designed in which a scaffold of amino 
acids remain constant (e.g., an LDL A receptor domain, BGF domain) while the intervening 
amino acids in the scaffold comprise randomly generated amino acids, 

[164] The invention provides methods of identifying monomer domains that 
bind to a selected or desired Hgand or mixture of ligands. In some embodiments, monomer 
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domains are identified or selected for a desired property (e.g,, binding affinity) and then the 
monomer domains are formed into miiltimers. See, e.g.. Figure 4, For those embodiments, 
any method resulting in selection of domains with a desired property (e,g,, a specific binding 
property) can be used. For example, the methods can comprise providing a plurality of 

5 different nucleic acids, each nucleic acid encoding a monomer domain; translating the 
plurality of different nncleic acids, thereby providing a pteahty of different monomer 
domains; screening the plurality of different monomer domains for binding of the desired 
ligand or a mixture of ligands; and, identifying members of the plnrality of different 
monomer domains that bind the desired ligand or mixture of ligands. 

10 [1 65] Monomer domains can be naturally-occmring or altered (non-natural 

variants). The term "naturally occurring'' is used herein to indicate that an object can be 
found in nature. For example, natural monomer domains can include human monomer 
domains or optionally, domains derived from different species or sources, e.g,, mammals, 
primates, rodents, fish, birds, reptiles, plants, etc. The natural occurring monomer domains 

15 can be obtained by a number of methods, e.g., by PGR amplification of genomic DNA or 
cDNA. 

[166] Monomer domains of the present invention can be naturally-occuiring 
domains or non-naturally occurring variants. Libraries of monomer domains employed in the 
practice of the present invention may contain nalnrally-occnrring monomer domam, non- 

20 naturally occurring monomer domain variants, or a combination thereof 

[167] Monomer domain variants caa include ancestral domains, chimeric 
domains, randomized domains, mutated domams, aad the like. For example, ancestral 
domains can be based on phylogenetic analysis. Chimeric domains are domains in which one 
or more regions are replaced by corresponding regions from other domains of the same 

25 family. For example, chimeric domains can be constructed by combining loop sequences 
from mnltiple related domains of the same family to form novel domains with potentially 
lowered immunogenicity . Those of skill in the art will recognized the immnnologic benefit 
of constructing modified binding domain monomers by combining loop regions from various 
related domains of the same family rather than creating random amino acid sequences. For 

30 example, by constructing variant domains by combining loop sequences or even multiple 
loop sequences that occm- naturally in human LDL receptor class A-domains, the resulting 
domains may contain novel binding properties but may not contain any immunogenic protein 
sequences because all of the exposed loops are of human origin. The combining of loop 
amino acid sequences in endogenous context can be applied to all of the monomer constructs 
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of the invention. Thus the present invention provides a method for generating a library of 
chimeric monomer domains derived from human proteins^ the method comprising: providing 
bop sequences corresponding to at least one loop from each of at least two different naturally 
occurring variants of a human protein, wherein the loop sequences are polynucleotide or 
polypeptide sequences; and covalently combining loop sequences to generate a library of at 
least two different chimeric sequences^ wherein each chimeric sequence encodes a chimeric 
monomer domain having at least two loops. Typically, the chimeric domain has at least four 
loopSj and usually at least six loops. As described above, the present invention provides three 
types of loops that are identified by specific features, such as, potential for disulfide bonding, 
bridging between secondary protein structures, and molecular dynamics (i.e., flexibility). 
The three types of loop sequences are a cysteine-deflned loop sequence, a structure-defined 
loop sequence^ and a B-factor-defined loop sequence. 

[1681 Randomized domains are domains in which one or more re^ons are 
randomized. The randomization can be based on foil randomizationj or optionally, partial 
randomization based on natural distribution of sequence diversity. 

[169] The present invention also provides recombinant nucleic acids 
encoding one or more polypeptides comprising one or aplm-ality of monomer domains that 
bind c-MBT. For example, the polypeptide can be selected to comprise a non-naturally 
occuring domain from the group consisting of: an EGF-like domain, a Kringle-domain, a 
fibronectin type I domain, a fibronectin type II domain^ a fibronectin type HI domain, a PAN 
domain, a Gla domain, a SRCR domain, a Kunitz/Bovine pancreatic trypsin Inhibitor domain, 
a Kazal-type serine protease inhibitor domain, a Trefoil (P-type) domain, a von Willebrand 
factor type C domain^ an Anaphylatoxin-like domain, a CtJB domain, a thyxoglobulin type I 
repeatj L0L-receptor class A domain, a Sushi domain, a Link domain, a Thrombospondin 
type I domain, an hnmunoglobulin4ike domain, a C-type lectin domain, a MAM domain, a 
von Willebrand factor type A domain, a Somatomedin B domain, a WAP4ype four disulfide 
core domain, a F5/8 type C domain, a Hemopexin domain, an SH2 domain, an SH3 domain, 
a Laminin-type EGF-like domain, a C2 domain and variants of one or more thereof In 
another embodiment, the naturally occuring polypeptide encodes a monomer domain found in 
the Pfam database and/or the SMART database. 

[170] All the compositions of the present invention, includmg the 
compositions produced by the methods of the present invention, e.g., monomer domains 
and/or immuno-domains, as well as multimers and libraries thereof can be optionally bound 
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to a matrix of an affimty material ExaiTiples of affinity tnaterial ixicl-ude beads, a colimn, a 
solid support, a miofoarray, oilier pools of reagent-supports, and the like. 



IIL MVIIJMERS 

[171] Methods for generating mxjltimers are a feature of the present 
invention. Multimers comprise at least two monomer domains. For example, xnultimers of 
the invention can comprise fi'om 2 to about 10 monomer domains, from 2 and about 8 
monomer domains, from about 3 and about 10 monomer domains, about 7 monomer 
domains, about 6 monomer domains, about 5 monomer dommns, or about 4 monomer 
domains. In some embodiments, the multimer comprises 3 or at least 3 monomer domains. 
In some embodiments, the mutimers have no more than 2, 3, 4, 5, 6, 7, or 8 monomer 
domains. In view of the possible range of monomer domain sizes, the multimers of the 
invention may be, e.g., less than 100 kD, less than 90kD, less than 80kD, less than 70kD, less 
than 60kD, less than 50kd, less than 40kD, less than 30kD, less than 251cD, less than ZOlcD, 
less than 15kD, less than lOkD or may be smaller or larger. In some cases, the monomer 
domains have been pre-selected for binding to the target molecule of interest (e,g,. Met). 

[172] In some embodhnentSj each monomer domain specifically binds to one 
target molecule (e.g., c-Met). In some of these embodiments, each monomer binds to a 
dilRferent position (analogous to an epitope) on a target molecule. Multiple monomer domains 
that bind to the same target molecule results m an avidity effect resulting in improved affinity 
of the multimer for the target molecule compared to the affinity of each individual monomer 
In some embodiments, the multimer has an avidity of at least about L5, 2, 3, 4, 5, 10, 20, 50, 
100, 200, 500, or 1000 times the avidity of a monomer domain alone. In some embodiments, 
at least one, two, three, four or more (e.g., all) monomers of a multimer bind an ion such as 
calcium or another ion. Multimers can comprise a variety of combinations of monomer 
domains. For example, in a single multimer, the selected monomer domains can be identical 
or different. In addition, the selected monomer domains can comprise various different 
monomer domains from the same monomer domain family, or various monomer domains 
from different domain families , or optionally, a combination of both. For example, the 
monomer domains maybe selected from Families MO of c-Met binding monomer domains. 
In some embodiments, at least one of the monomer domains is selected from from Family 10 
of the C"Met binding monomer domains. Exemplary c-MET-binding dhners (comprised of 
two C"MET-binding monomers) are listed in the examples. 
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[173] Multiraers that are generated in the practice of the present inventioii 
may be any of the following: 

(1) A homo-multimer (amxiltimer of the same domain, i.e., Al-Al-Al-Al); 

(2) A hetero-multimer of different domains of the seone domain class, e.g., A1-A2~A3- 
A4. For example, hetero-multimer include multimers where Al , A2, A3 and A4 are different 
non-nalTirally occuning variants of a particular LDL-receptor class A domains, or where 
some of Al , A2, A3, and A4 are naturally-occurring variants of a LDL-receptor class A 
domain. 

(3) A hetero-mxtltimer of domains &om different monomer domain classes, e.g., Al- 
B2-A2-B1 . For exMnple, where Al and A2 are two different monomer domains (either 
naturally occurring or non-n^ally-occurring) from LDL-receptor class A, and Bl and B2 
are two different monomer domains (either naturally occurring or non-naturally occurring) 
from class EGF-like domain). 

1174] In another embodiment, the multimer comprises monomer domains 
with specificities for different target molecules ie.g., a blood factor such as serum albumin, 
immunoglobulin, or erythrocytes). For example, in some embodiments, the raultimers of the 
invention comprises 1, 2, 3, or more monomer domains that bind to Met and at least one 
monomer domain that binds to a second target molecule. Exemplary target molecules 
include, e.g., a serum molecule that extends the serum half-life of the multimer {e.g., an 
immunoglobulin or serum albumin), EGFR gene family members, VEGF receptors, PDGF 
receptor, other receptor tyrosine kinases, integrins, other molecules implicated in 
tumorigenesis, or markers of tnmor tissue. Exemplary molecule that extends the serum half- 
life of a multimer include, e.g., red blood cells {i.e., erythrocytes), IgG, and serum albumin 
such as HSA. An exemplary multimer will mclude a monomer domain from Family 10 of 
the c-MET binding monomer domains and monomer domain from Family 2 or 3 of the 
immunoglobulin binding monomer domains. 

[175] Multimer libraries employed in the practice of the present invention 
may contain homo-multimere, hetero-multimers of different monomer domains (natural or 
non-natural) of the same monomer class, or hetero-multimers of monomer domains (natural 
or non-natural) from different monomer classes, or combinations thereof 

[176] Monomer domains, as described herein, are also readily employed in a 
immuno-domain-containing heteromultimer (i.e., a multimer that has at least one iramuno- 
domain variant and one monomer domain variant). Thus, multimers of the present invention 
may have at least one immuno-domain such as a minibody, a single-domain antibody, a 
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single chain variable fragment (ScFv)^ or a Fab fragment; md at least one monomer domain^ 
such as, for example, an EGF-like domain, a Kringle-domain, a fibronectin type I domain, a 
fibronectin type II domain, a fibronectin type III domain, a PAN domain, a Gla domain, a 
SRCR domain, a Kmiitz/Bovine pancreatic trypsin Inhibitor domain, a K^al-type serine 

5 protease inhibitor domain, a Trefoil (P-type) domain, a von Willebrand factor type C domain, 
an Anaphylatoxin-like domain, a CUB domain, a thyroglobxilin type I repeat, LDL-receptor 
class A domain, a Snshi domain, a Link domain, a Thrombospondin type I domain, an 
Immnnoglobulin-Iike domain, a C-type lectin domain, a MAM domain, a von Willebrand 
factor type A domain, a Somatomedin B domain, a WAP-type four disulfide core domain, a 

10 F5/8 type C domain, a Hemopexin domain, an SH2 domain, an SH3 domain, a Laminin-type 
EGF-Iike domain, a C2 domain, or variants thereof 

[1771| Domains need not be selected before the domains are linked to form 
multimers. On the other hand, the domains can be selected for the ability to bind to a target 
molecule before being linked into multimers. Thus, for example, a multjmer can comprise 

1 5 two domains that bind to one target molecule and a third domain that binds to a second target 
molecule. 

[178] The multimers of the present invention may have the following 
qualities: multivalent, multispecific, single chain, heat stable, extended serum and/or shelf 
half-life. Moreover, at least one, more than one or all of the monomer domains may bind an 

20 ion (e*g,, a metal ion or a calcium ion), at least one^ more than one or all monomer domains 
may be derived from LDL receptor A domains and/or EGF-like domains, at least one, more 
than one or all of the monomer domains may be non-naturally occurring, and/or at least one, 
more than one or all of the monomer domains may comprise 1 , 2, 3, or 4 disulfide bonds per 
monomer domain. In some embodiments, the multimers comprise at least two (or at least 

25 three) monomer domains, wherein at least one monomer domain is a non-naturally occurring 
monomer domain and the monomer domains bind calcium* In some embodiments, the 
multimers comprise at least 4 monomer domains, wherein at least one monomer domain is 
non-naturally occurring, and wherein: 

a. each monomer domain is between 304 00 amino acids and each of the monomer 
30 domains comprise at least one disulfide linkage; or 

b. each monomer domain is between 30-1 00 amino acids and is derived from an 
extracellular protein; or 

each monomer domam is between 30400 amino acids and binds to a protein target. 
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[179J In some embodiments^ Hie Tnultimers comprise at least 4 monomer 
domainsj wherein at least one monomer domain is non-naturally occurring, and wherein: 

a. each monomer domain is between 35-1 00 amino acids; or 

b. each domain comprises at least one disulfide bond and is derived jBrom a hmnan 
protein and/or an extracellxilar protein. 

[1801 In some embodiments, the multimers comprise at least two monomer 
domains, wherein at least one monomer domain is non-naturally occurring^ and wherein each 
domain is: 

a. 25-50 amino acids long and comprises at least one disulfide bond; or 

b. 25-50 amino adds long and is derived from an extracellular protein; or 

c. 25-50 amino adds and binds to a protein target; or 

d. 35-50 amino acids long. 

[181] In some embodiments^ the mnltimers comprise at least two monomer 
domains, wherein at least one monomer domain is non-naturally-occnrring and: 
a each monomer domain comprises at least one disulfide bond; or 

b. at least one monomer domain is derived from an extracellular protein; or 

c. at least one monomer domain binds to a target protein. 

{182] The monomer domains and/or multimers identified can have 
biological activity, which is meant to indnde at least specific binding affinity for a seleded or 
desired Uganda and, in some instances, will farther include the ability to block the binding of 
other compounds, to stimulate or inhibit metabolic pathways, to act as a signal or messenger, 
to stimulate or inhibit cellular activity, and the like. Monomer domains can be generated to 
function as ligands for receptors where the natural hgand for the receptor has not yet been 
identified (orphan receptors). These orphan ligands can be created to either block or activate 
the receptor top which they bind, 

[183] A single ligand can be used, or optionally a variety of Hgands can be 
used to select the monomer domains and/or multimers. A monomer domain of the present 
invention can bind a single ligand or a variety of ligands. A multimer of the present 
invention can have multiple discrete binding sites for a single ligand, or optionally, can have 
multiple binding sites for a variety of hgands. 

[184] hi some embodiments, the multimer comprises monomer domains with 
specificities for different proteins. The different proteins can be related or unrelated. 
Examples of related proteins including members of a protein family or different serotypes of 
a virus. Alternatively, the monomer domains of a multimer can target different molecules in 
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a physiolo^cal pathway (e,g,, different Wood coagulation proteins). In yet other 
embodiments^ monomer domains bind to proteins in muelated pathways (ag.j two domains 
bind to blood factors, two other domains bind to inflammation-related proteins and a fifth 
binds to serum albimin). In another embodiment, a multimer is comprised of monomer 
domains that bind to different pathogens or contaminants of interest. Such multimers are 
useful as a single detection agent capable of detecting for the possibility of any of a number 
of pathogens or contaminants. 

[185] In some embodiments, the mnltimers of the invention bind to the same 
or other muHimers to form aggregates. Aggregation can be mediated^ for example, by the 
presence of hydrophobic domains on two monomer domains, resulting in the formation of 
non-covalent interactions between two monomer domains. Alternatively, aggregation may be 
facilitated by one or more monomer domains in a multimer having binding specificity for a 
monomer domain in another multimer. Aggregates can also fom due to the presence of 
affinity peptides on the monomer domains or multimers. Aggregates can contain more target 
molecule binding domains than a single multimer. 

1186] Multimers with affinity for both a cell surface target and a second 
target may provide for increased avidity effects. In some cases^ membrane fluidity can be 
more flexible than protein linkers in optimizing (by self-assembly) the spacing and valency of 
the interactions. In some cases^ multimers will bind to two different targets, each on a 
different cell or one on a cell and another on a molecule with multiple binding sites, 

[187] hi some embodiments, the monomers or multimers of the present 
invention are linked to another polypeptide to form a fusion protein. Any polypeptide in ttie 
art may be used as a fusion partner, though it can be useM if the fusion partner forms 
multimers. For example, monomers or multimers of the invention may, for example, be 
fused to the following locations or combinations of locations of an antibody: 

1 . At the N-terminus of the VH 1 and/or VLi domains, optionally just after the 
leader peptide and before the domain starts (framework region 1 ); 

2. At the N-terminus of the CHI or CH domain, replacing the VHl or VLI 
domain; 

3. At the N-terminus of the heavy chain, optionally after the CHI domain and 
before the cysteine residues in the hinge (Fc™fiision); 

4. At the N-terminus of the CHS domain; 

5. At the C-terminus of the CH3 domain, optionally attached to the last amino 
acid residue via a short linker; 
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6. At the C"terminus of the CH2 domain, replacing the CHS domam; 

7. At the C-temiirms of the CLI or CHI domain, optionally after the cysteine 
that forms the interchain disulfide; or 

8. At the C-terminiis of the VHl or VLl domain, See^ e.g., Fi^re 7. 

5 [188] In some embodimentSj one or more monomer or mxiltimer domains of 

flie invention is linked to a molecule (e.g., a protein, nucleic acid^ organic small molecule^ 
etc.) useful as a pharmaceutical. Exemplary pharmaceutical proteins include, e,g,j cytokines, 
antibodies, chemokineSj growth factorSj interleukins^ cell-surface proteins^ extracellular 
domains, cell surface receptors, cytotoxins, etc. Exemplary small molecule pharmaceuticals 

1 0 include toxins or therapeutic agents, hi some embodiments, a metal can be bound to the 
polypeptides of the invention. This can be useful^ e,g,, as a contrast agent, e.g., for MRL 

[189) In some embodiments^ the monomer or multimers are selected to bind 
to a tissue- or disease-specijSc tai^get protein. Tissue-specific proteins are proteins tliat are 
expressed exclusively, or at a significantly higher level, in one or several particular tissue(s) 

15 compared to other tissues in an animal. As c-MET is expressed at significant levels in the 
liver^ monomer domains that bind to Met may be used to target other molecules, including 
other monomer domains^ to the liver. Tliis maybe used to target liver-specific diseases;, for 
example, by targeting therapeutic or toxic molecules to the liver. An example of a liver 
disease that can be treated is hepatocellular carcinoma. Similarlyj disease-specific proteins 

20 are proteins that are expressed exclusively, or at a significantly higher level, in one or several 
diseased cells or tissues compared to oth^ non-diseased cells or tissues in an animal. 

[190] In some embodiments, the monomers or multimers that bind to tlie 
target protein are linked to the pharmaceutical protein or small molecule such that the 
resulting complex or fusion is targeted to the specific tissue or disease-related cell(s) where 

25 the target protein (e.g., c-MBT) is expressed. Monomers or multimers for use in such 

complexes or fusions can be initially selected for binding to the target protein and may be 
subsequently selected by negative selection against other cells or tissue (e,g.j to avoid 
targeting bone marrow or other tissues that set the lower limit of drug toxicity) where it is 
desired that binding be reduced or eliminated in other non-target cells or tissues. By keeping 

30 the pharmaceutical away from sensitive tissues, the therapeutic window is increased so that a 
higher dose may be administered safely. In another alternative^ in vivo panning can be 
performed in animals by injecting a library of monomers or multinfiers into an animal and 
then isolating the monomers or multimers that bind to a particular tissue or cell of interest. 
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[191J The fosioB proteins described above may also include a linker peptide 
between the pharmaceutical protein and the tnonomer or multimers. A peptide linker 
seqiience may be employed to separate, for example, the polypeptide components by a 
distance snfficierxt to ensure that each polypeptide folds into its secondary and tertiary 

5 structures. Fusion proteins may generally be prepared using standard techniques, including 
chemical conjugation. Fusion proteins can also be expressed as recombinant proteins in an 
expression system by standard techniques. 

[192] Multimers or monomer domains of the invention can be produced 
according to any methods known in the art. In some embodimentSj E, coli comprising a pET™ 

10 derived plasmid encoding the polypeptides are induced to express the protein. After 

harvesting the bacteria, they may be lysed and clarified by centrifugation. The polypeptides 
may be purified using Ni-NTA agarose elution and refolded by dialysis. Misfolded proteins 
may be neutralized by capping free sulfiiydrils with iodoacetic acid, Q sepharose elution^ 
butyl sepharose flow-through, SF sepharose elution^ DEAE sepharose elution, and/or CM 

1 5 sepharose elution may be used to purify flie polypeptides. Equivalent anion and/or cation 
exchange purification steps may also be employed. 

[193| In some embodiments^ the polypeptide comprising a monomer or 
multimer of the invention is linked to itself (C-terminus to N-terminus), e.g.^ for proteion 
stability, 

20 

/K LINKERS 

[194] Monomer domains can be joined by a hnker to form a multimer. For 
example^ a linlcer may be positioned between each separate discrete monomer domain in a 
multimer. 

25 [195] Joining the selected monomer domains via a linker can bo 

accomplished using a variety of techniques known in the art. For example^ combinatorial 
assembly of polynucleotides encoding selected monomer domains can be achieved by 
restriction digestion and re-ligation, by PCR-based, self-priming overlap reactions, or other 
recombinant methods. The Hnker can be attached to a monomer before the monomer is 

30 identified for its ability to bind to a target multimer or after the monomer has been selected 
for the ability to bind to a target multimer. 

[196] The linker can be naturally-occurring, synthetic or a combination of 
both. For example, the synthetic linker can be a randomized Hnker^ e.g,, both in sequence 
and size, to one aspect^ the randomized linker can comprise a fully randomized sequence^ or 
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optionally, the randomized linker can be based on natoal linker sequences. The linker can 
comprise^ e.g^. a non-polypeptide moiety, a polynucleotide^ a polypeptide orthelika 

[197| Alinkercanberigid, or flexible, or a combination of both. Linker 
flexibility can be a function of the composition of both the linker and the monomer domains 

5 that the linker interacts with. The linlcer joins two selected monomer domain^ and maintains 
the monomer domains as separate discrete monomer domains. The linker can allow the 
separate discrete monomer domains to cooperate yet maintain separate properties such as 
multiple separate binding sites for the same ligand in a multimerj or e.g,, multiple separate 
binding sites for different ligands in a multimer. 

10 [198] Choosing a suitable linker for a specific case where two or more 

monomer domains (i.e, polypeptide chains) are to be connected may depend on a variety of 
parameters including^ e.g. the nature of the monomer domains, the structure and nature of the 
target to which the polypeptide multimer should bind and/or the stability of the peptide linker 
towards proteolysis and oxidation. 

1 5 [199] The present invention provides methods for optimizing the choice of 

linker once the desired monomer domains/variants have been identified. Generallys libraries 
of multimers having a composition that is fixed with regard to monomer domain composition^ 
but variable in linker composition and lengthy can be readily prepared and screened as 
described above. 

20 [200] A more detailed discussion of linkers can be found in^ e.g., U.S. Patent 

Publication No. 2005/0048512. 

K IDENTIFYING MONOMERS OR MULTIMESS WITH AFFINITY FOR 

A TARGET MOLECULE 

25 [201] Those of skill in the art cm readily identify monomer domains with a 

desired property (e,g,, binding affinity). For those embodiments, any method resulting in 
selection of domains mill a desired property (e.g., a specific binding property) can be used. 
For example^ the methods can comprise providing a plurality of different nucleic acids, each 
nucleic acid encoding a monomer domain; translating the plurality of different nucleic acids^ 

30 thereby providing a plurality of different monomer domains; screening the plurality of 
different monomer domains for binding of the desired ligand or a mixture of ligands; and, 
identifying members of the plurality of different monomer domains that bind the desired 
ligand or mixture of Ugands. 
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{202] In addition, any method of mutagenesis, such as site-directed 
mutagenesis and random mutagenesis (e.g., chemical mutagenesis) can be used to produce 
monomer domains, e.g., for a monomer domain library, fa some embodiments, error-prone 
PGR is employed to create variants. Additional methods include aligning a plurality of 
naturally occuning monomer domains by aligning conserved amino acids in the plurality of 
naturally occuxring monomer domains; and, designing the non-nalOTally occurring monomer 
domain by maintaining the conserved amino acids and inserting^ deleting or altering amino 
acids around the conserved amino acids to generate the non-naturally occurring monomer 
domain. In one embodiment, the conserved amino acids comprise cysteines. In another 
embodiment, the inserting step uses random amino acids, or optionally, the inserting step uses 
portions of the naturally occurring monomer domains. The portions could ideally encode 
loops from domains from the same family. Amino acids are inserted or exchanged using 
synthetic oligonucleotides^ or by shuffling, or by restriction enzyme based recombination. 
Human chimeric domains of the present invention are useM for therapeutic applications 
where minimal immunogenicity is desired. The present invention provides methods for 
generating libraries of human chimeric domains. Human chimeric monomer domain 
libraries can be constructed by combining loop sequences from different variants of a human 
monomer domain, as described above. The loop sequences that are combined may be 
sequence-defined loops, structure-defined loops, B-factor-deflned loops^ or a combination of 
any two or more thereof. 

[203] Alternatively, a human chimeric domain library can be generated by 
modifying naturally-occurring human monomer domains at the amino acid level, as 
compared to the loop level In some embodiments, to minimize the potential for 
immunogenicity, only those residues that naturally occur- in protein sequences from the same 
family of human monomer domains are utilized to create the chimeric sequences. This can 
be achieved by providing a sequence alignment of at least two human monomer domains 
from the same family of monomer domains, identifying amino acid residues in corresponding 
positions in the human monomer domain sequences that differ between the human monomer 
domains, generating two or more human chimeric monomer domains, wherein each human 
chimeric monomer domain sequence consists of amino acid residues that correspond in type 
and position to residues from two or more human monomer domains from the same family of 
monomer domains. Libraries of human chimeric monomer domains can be employed to 
identify human chimeric monomer domains that bind to a tai'get of interest by: screening the 
library of human chimeric monomer domains for binding to a target molecule, and 
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identifying a human chimeric monomer domain that binds to the target molecule, Sintable 
naturally-occurring human monomer domain sequences employed in the initial sequence 
alignnaent step include those corresponding to any of the naturally-occurring monomer 
domains described herein. 
5 p04] Domains of hiiman monomer variant libraries of the present invention 

(whether generated by varying loops or single amino acid residues) can be prepared by 
methods known to those having ordinary skill in the art. Methods paiticularly suitable for 
generating these libraries are split-pool format and trinucleotide synthesis format as described 
in WOOl/23401- 

10 [205] hi some embodiments, monomer domains of the invention are screened 

for potential immnnogenicity by: 

providing a candidate protein sequence; 

comparing Ihe candidate protein sequence to a database of human protein sequences; 
identifying portions of the candidate protein sequence that correspond to portions of 
1 5 human protein sequences from the database; and 

determining the extent of correspondence between the candidate protein sequence and 
the hxmian protein sequences from the database. 

[2061 fa generalj the greater the extent of correspondence between the 
candidate protein sequence and one or more of the human protein sequences from the 
20 database, the lower the potential for immunogemcity is predicted as compared to a candidate 
protein having little correspondence with any of the human protein sequences from the 
database. A database of human protein sequences that is suitable for use in the practice of the 
invention method for screening candidate proteins can be found at 

ncbi,nIm,mh.gov/blast/Blast.cgi at the World Wide Web (in addition^ flie following web site 
25 can be used to search short, nearly exact matches: cbi.nlm.nih.gov^last/Blast.cgi?CMD 

=Web&LAYOUT-TwoWindows&AUTOJORMAT-Semiauto&ALIGNMENTS-50&ALI 
GNMENT__VIEW-Pairwise&CLIENT-^eb&DATABASE-^&DESCMPTIONS-100& 
TREZ_QUERY=(none)&EXPECT-1000&FORMAT_OBJECT-AligDment&FO^^ 
PE-HTML&NCBI_GI^on&PAGE-Nucleotides&PROGRAM-blastn&SERVICE^^ 
30 ET^DEFAULTS.x-29&SET_DEFAULTS.y-6&SHOW_OVERVmW-on&WORr>_SIZE= 
7&END^0FJiTTPGET-Yes&SH0W^LINK0UT-yes at the Worid Wide Web). The 
method is particularly useftil in deteimining whether a crossover sequence in a chimeric 
protein^ such as, for example^ a chimeric monomer domain^ is likely to cause an 
immunogenic event. If the crossover sequence corresponds to a portion of a sequence found 
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in the database of human protein sequences, it is believed that the crossover sequence is less 
likely to cause an inHnnnogemc event. 

[207] Infonnation pertaining to portions of hrnnan protein sequences from 
the database can be used to design a protein library of human-like chimeric proteins. Such 
library can be generated by using information pertaining to ^'crossover sequence" that exist 
in naturally occurring human proteins. The term ''crossover sequence" refers herein to a 
sequence that is found in its entirety in at least one naturally occurring human protein, in 
which portions of the sequence are found in two or more naturally occurring proteins. Thus, 
recombination of the latter two or more naturally occurring proteins would generate a 
chimeric protein in which the chimeric portion of the sequence actually corresponds to a 
sequence found in another naturally occurring protein. The crossover sequence contains a 
chimeric junction o:^ two consecutive amino acid residue positions in which the first amino 
acid position is occupied by an amino acid residue identical in type and position found in a 
first and second naturally occurring human protein sequence^ but not a third naturally 
occurring human protein sequence. The second amino acid position is occupied by m amino 
acid residue identical in type and position found in a second and third naturally occurring 
human protein sequence, but not the first naturally occurring human protein sequence. In 
other words, the "second'' naturally occurring human protein sequence corresponds to the 
naturally occurring human protein in which the crossover sequence appears in its entirety, as 
described above. 

[208] In some embodiments, a library of human-like chimeric proteins is 
generated by: identifying human protein sequences from a database that correspond to 
proteins fi^om the same family of proteins; aligning the human protein sequences from the 
same family of proteins to a reference protein sequence; identifying a set of subsequences 
derived fi'om different human protein sequences of the same family, wherein each 
subsequence shares a region of identity with at least one other subsequence derived from a 
different naturally occurring human protein sequence; identifying a cWmeric junction from a 
first, a second, and a third subsequence, wherein each subsequence is derived from a different 
natui-ally occurring human protein sequence, and wherein the chimeric junction comprises 
two consecutive amino acid residue positions in which the first amino acid position is 
occupied by an amino acid residue common to the first and second naturally occurring human 
protein sequence, but not the third naturally occuning human protein sequence, and the 
second amino acid position is occupied by an amino acid residue common to the second and 
third naturally occurring human protein sequence, and generating human-like chimeric 
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protein molectiles each conesponding in sequence to two or more subsequences from the set 
of subsequences, and each comprising one of more of the identified chimeric junctions, 
[209] Thus, for example, if the first naturally-occurring human protein 
sequence is, A-B-C, and the second is, B^C-D-B, and the third is, D-E-F, then the chimeric 
5 junction is C-D, Alternatively, if the first naturally-occuning human protein sequence is 
E-F-G, and the second is B-C-D-^E-F, and the third is A-B-C^-D, then the chimeric junction is 
D"E. Human-like chimeric protein molecules can be generated in a variety of ways. For 
example, oligonucleotides comprising sequences encoding the chimeric junctions can be 
recombined with oligonucleotides corresponding in sequence to two or more subsequences 
1 0 from the above-described set of subsequences to generate a human-like chimeric protein, and 
libraries thereof. The reference sequence used to align the naturally occurring human 
proteins is a sequence from the same family of naturally occurring human proteins, or a 
chimera or other variant of proteins m the family. 

[210] Nucleic acids encoding fragments of naturally-occuning monomer 
1 5 domains can also be mixed and/or recombined (e.g., by using chemically or enzymatically- 
produced fi^agments) to generate full-length, modified monomer domains. The frag^ients and 
the monomer domain can also be recombined by manipulating nucleic acids encoding 
domains or fiagraents thereof. For example, ligating a nucleic acid construct encoding 
fragments of the monomer domain can be used to generate an altered monomer domain. 
20 [211} Altered monomer domains can also be generated by providing a 

collection of synthetic oligonucleotides (e.g., overlapping oligonucleotides) encoding 
conserved, random, pseudorandom, or a defined sequence of peptide sequences that are then 
inserted by Hgation into a predetermined site in a polynucleotide encoding a monomer 
domain, Similariy, the sequence diversity of one or more monomer domains can be 
25 expanded by mutating the monomer domain(s) with site-directed mutagenesis, random 

mutation, pseudorandom mutation, defined kemal mutation, codon-based mutation, and the 
like. The resultant nucleic acid molecules can be propagated in a host for cloning and 
amplification. In some embodiments, the nucleic acids are shuffled. 

[212] The present invention also provides a method for recombining a 
3 0 plurality of nucleic acids encoding monomer domains and screening the resulting library for 
monomer domains that bind to the desired ligand or mixture of ligands or the like. Selected 
monomer doiuain nucleic acids can also be back-crossed by shuffling with polynucleotide 
sequences encoding neutral sequences (i.e., having insubstantial functional effect on bmding), 
such as for example, by back-crossing with a wild-type or naturally-occurring sequence 
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suljstafltially identical to a selected sequence to produce native-like functional monomer 
domains. Generally, during back-crossing, subsequent selection is applied to retain the 
property, e.g., binding to the ligand. 

[213] In some embodiments, the monomer library is prepared by shuffling. 
5 In such a case, monomer domains are isolated and shuffled to combinatorially recombine the 
nucleic acid sequences that encode the monomer domains (recombination can occur between 
or within monomer domains, or both). The first step involves identifying a monomer domain 
having the desired property, e.g., affinity for a certain ligand. While maintaining the 
conserved amino adds during the recombination, the nucleic acid sequences encoding the 
10 monomer domams can be recombined, or recombined and johied into multuners. 

[2141 A significant advantage of the present invention is that known ligands, 
or unknown Hgands can be used to select the mcnomer domains and/or multimesrs. No prior 
information regarding ligand structure is required to isolate the monomer domains of interest 
or the multimers of interest. The monomer domains and/or multimers identified can have 
1 5 biological activity, which is meant to include at least specific binding ai&iity for a selected or 
desired Ugand, and, in some instances, will fiirther include the ability to block the bindmg of 
other compounds, to stimulate or inhibit metabolic pa&ways, to act as a signal or messenger, 
to stimulate or inhibit cellular activity, and the like. Monomer domains can be generated to 
function as ligands for receptors where the natural ligand for the receptor has not yet been 
20 identified (orphan receptors). These orphan ligands can be created to either block or activate 
the receptor to which they bind. 

[215] A single ligand can be used, or optionally a variety of ligands can be 
used to select the monomer domains and/or multimers. A monomer domain of the present 
invention can bind a single ligand or a variety of ligands. A multimer of the present 
25 invention can have multiple discrete binding sites for a single ligand, or optionally, can have 
multiple binding sites for a variety of ligands. 

[216] The invention also includes compositions that are produced by 
methods of the present invention. For example, the present invention includes monomer 
domains selected or identified fiom a library and/or libraries comprising monomer domains 
30 produced by the methods of the present invention. 

[217] The present invention also provides libraries of monomer domains and 
libraries of nucleic adds that encode monomer domains. The libraries can include, e.g., 
about 100, 250, 500 or more nucleic adds encoding monomer domains, or tiie library can 
include, e.g., about 100, 250, 500 or more polypeptides that encode monomer domains. 
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Libraries can incMe monomer domains containing the same cysteine frame, e.g., A-domains 
or EGF-like domains. 

[218] In some embodiments, variants are generated by recombining two or 
more different seqiiences from the same family of monomer domains (ag,, the LDL receptor 
class A domain). Alternatively, two or more different monomer domains from different 
families can be combined to form a mnltimer. In some embodiments, the multiraeis are ^ 
formed from monomers or monomer variants of at least one of the following family classes: 
an EGF-like domain, a Kringle-domain, a jSbronectin type I domain, a fibronectin type 11 
domain, a fibronectin type III domain, a PAN domain, a Gla domain, a SRCR domain, a 
K-anitz/Bovine pancreatic trypsin Inhibitor domain, a Kazal-type serine protease inhibitor 
domain, a Trefoil (P-type) domain, a von Willebrand factor type C domain, an 
Anaphylatoxin-like domain, a CUB domain, a thyroglobulin t;^e I repeat, LDL-receptor 
class A domain, a Sushi domain, a Lmk domain, a Thrombospoudin type I domain, an ^ 
hmBunoglobnlin-lilce domain, a C-type lectin domain, a MAM domain, a von Willebrand 
factor type A domain, a Somatomedin B domain, a WAP-type fonr disulfide core domain, a 
FS/S type C domain, a Hemopexin domain, an SH2 domain, an SH3 domain, a Laminin-type 
EGF-like domain, a C2 domain and derivatives thereof. In another embodiment, the 
monomer domain and the different monomer domain can include one or more domains found 
in the Pfam database and/or the SMART database. Libraries produced by the methods above, 
one or more cell(s) comprising one or more members of the library, and one or more displays 
comprising one or more members of the library are also included in the present invention, 

(219) Optionally, a data set of nncleic acid character strings encoding 
monomer domains can be generated e.g., by mixing a first character string encoding a 
monomer domain, with one or more character string encoding a different monomer domain, 
thereby producing a data set of nucleic acids character strings encoding monomer domains, 
including those described herein. In another embodiment, the monomer domain and the 
different monomer domain can include one or more domains found in the Pfam database 
and/or the SMART database. The methods can fijrflier comprise inserting the first character 
string encoding the monomer domain and the one or more second character string encoding 
the different monomer domain in a computer and generating a multimer chai'acter string(s) or 
library(s), thereof in the computer, 

(220J The libraries can be screened for a desired property such as binding of 
a desired ligand or mixture of ligands. For example, members of the library of monomer 
domains caabe displayed and prescreened for binding to a Icnown or unknown hgand or a 
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mixture of Hgands. The monomer domain s^queuces can then be mutagenized {e.g.^ 
recombtned, chemically altered, etc.) or otherwise altered and the new monomer domains can 
be screened again for binding to the ligand or the mixture of ligands with an improved 
affinity. The selected monomer domains can be combined or joined to form mnltimers, 

5 which can then be screened for an improved affmity or avidity or altered specificity for the 
ligand or the mixture of ligands. Altered specificity can mean that the specificity is 
broadened, e.g.j binding of mnltiple related viruses, or optionally, altered specificity can 
mean that the specificity is narrowed, e.g., binding within a specific region of a ligand. 
Those of skill in the art will recognize that there are a number of methods available to 

10 calculate avidity. See, e.g., Mammen et al, Angew Chem Int. Ed. 37:2754-2794 (1998); 
Muller et al, Anal Biochem. 261 :149-158 (1998). 

VT. SELECTION OF MONOMER DOMAINS THAT BIND c-MET 

[221] Preliminary screens can be conducted by screening for agents capable 

15 of binding to c-MET, as at least some of the agents so identified are likely o-MET modulators 
(e.g., antagonists or agonists). The binding assays usually involve contacting a c-MET 
protein (or a fi:agment thereof such as a firagment comprising the SEMA rdomain or the a 
chain) with one or more test agents (i.e., monomers or multhners of the invention) and 
allowing sufficient time for the protein and test agents to form a binding complex. Any 

20 binding complexes formed can be detected using any of a number of established analytical 
techniques. Protein binding assays include, but are not limited to, immunohistochemical 
binding assays, flow cytometry or other assays. The c-MET protein utilized in such assays 
can be naturally expressed, cloned or synthesized. Similar methods can be used to identify 
monomer domains or multimers that bind IgG. 

25 [222] The screening methods of the invention can be performed as in vitro or 

cell-based assays. Cell based assays can be performed in any cells in which c-MET is 
expressed. Cell-based assays may involve whole cells or cell fi-actions containing a c-MET 
receptor to screen for agent binding or modulation of activity of c-MET by the agent 
Exemplary cell types that can be used according to the methods of the invention include^ e.g., 

30 any mammalian cells, as well as fungal cells^ including yeast, and bacterial cells. Cells can 
be primary cells or tumor cells or other types of immortal cell lines. Of course, c-MET can 
be expressed in cells that do not cndogenously contain c-MET. 
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[223] c-MET activity assays may also be used to identify a modulator 
(antagonist or agonist) of c-MET. In these embodiments, one or more test agents are 
contacted to a cell expressing c-MET and then tested for an activity of c-MET. Exemplary 
MET activities include HGF-dependent or constitutive kinase activity. See^ e.g., Christensen 
5 et al. Cancer Res. 63:7345-7355 (2003), In other embodiments, down stream molecular 
events can also be monitored to determine signaling activity. For example, o-MET induces 
cell growth (proliferation and survival), cell motility, invasion and morphology changes. In 
addition, c-MET indirectly mediates phosphorylation of Gab-^l, Akt, signal transducer and 
activator of transcription 3, phosphoUpase C y, and focal adhesions kinase, among others, 

1 0 See, e.g. , Christensen et al. Cancer Res. 63 :7345-7355 (2003). 

[224] In some embodiments^ activity assays are also used to confirm that 
identified antagonist monomers or multimers (i.e., that compete with HGF) lack agonist 
activity (i.e., that they do not activate c-MET in the absence of HGF or another agonist). 

[225] Agents that are initially identified by any of the foregoing screening 

1 5 methods can be further tested to validate the apparent activity. Such studies may be 
conducted with suitable animal models. The basic format of such methods involves 
administering a lead compound identified during an initial screen to an animal that serves as a 
mode! for humans and then detemiining if c-MET is in fact modulated and/or the disease or 
condition is ameliorated. The animal models utilized in validation studies generally are 

20 mammals of any kind. Specific examples of suitable animals include, but are not limited to, 
primateSj mice and rats. 

[226] Selection of monomer domains that bind c-MET fi-om a library of 
domains can be accomplished by a variety of procedures. For example, one method of 
identifying monomer domains which have a desired property (e.g,, binding c-MET or IgG) 

25 involves translating a plurality of nucleic acids, where each nucleic acid encodes a monomer 
domairij screening the polypeptides encoded by the plurality of nucleic acidSj and identifying 
those monomer domains that, e.g., bind to a desired Hgand or mixture of ligands, thereby 
producing a selected monomer domain. The monomer domains expressed by each of the 
nucleic acids can be tested for their ability to bind to the ligand by methods Icnown in the ait 

30 (i,e. panning, affinity chromatography, FACS analysis). 

[227] As mentioned above, selection of monomer domains can be based on 
binding to a ligand such as c^MET, or a fi-agment therof or other target molecule (e.g., lipid, 
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carbohydrate, nucleic acid and the like). Other molecxiles can optionally be included in the 
methods along with the target, e.g., ions such as Ca"^^. 

[228J When a monomer domain of the invention is selected based on its 
ability to bind to a Ugand, the selection basis can include selection based on a slow 
5 dissociation rate, which is usually predictive of hi^ affinily. The valency of the ligand can 
also be varied to control the average binding afSnity of selected monomer domains. The 
ligand can be bound to a surface or substrate at varying densities, such as by including a 
competitor compound, by dilution, or by other method known to those in the art. High 
density (valency) of predetemiined ligand can be used to eraich for monomer domains that 
10 have relatively low affinity, whereas a low density (valency) can preferentiaUy enrich for 
higher affinity monomer domains. 

[229] A variety of reporting display vectors or systems can be used to 
express nucleic adds encoding the monomer domains and/or multimers of the present 
invention and to test for a desired activity. For example, a phage display system is a system 
15 in which monomer domains are expressed as fvision protems on the phage surface 

(Pharmacia, Milwaukee Wis.). Phage display can involve the presentation of a polypeptide 
sequence encoding monomer domains on the surface of a filamentous bacteriophage, 
typically as a fusion with a bacteriophage coat protein. 

[230] Generally in these methods, each phage particle or cell serves as an 
20 individual library member displaying a single species of displayed polypeptide in addition to 
the natural phage or cell protein sequences. The nucleic acids are cloned mto the phage DNA 
at a site which results in the transcription of a fusion protein, a portion of which is encoded 
by the plurality of the nucleic adds. The phage containing a nuddc acid molecule undergoes 
replication and transcription in the cell. The leader sequence of the fusion protein directs the 
25 transport of the fusion protein to the tip of the phage partide. Thus, the fusion protein that is 
partially encoded by the nucleic add is displayed on the phage particle for detection and 
selection by the methods described above and below. For example, the phage library can be 
incubated with a predetermined hgand such as c-MET or a fragment thereof, so that phage 
partides which present a fusion protein sequence that binds to the Ugand can be differentially 
30 partitioned from those that do not present polypef tide sequences that bind to the 

predetermined ligand, For example, the separation can be provided by immobilizing the 
predetermined ligand. The phage partides (i.e., Ubrary members) which are bound io the 
immobilized Ugand are then recovered and repUcated to amplify the sdected phage 
subpopulation for a subsequent round of affinity enrichment and phage replication. After 
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several rounds of affinity enrichment and phage replication, the phage library members that 
are thus selected are isolated and the nucleotide sequence encoding the displayed polypeptide 
sequence is deterrainedj thereby identifying the sequence(s) of polypeptides that bind to the 
predetermined Hgand* Such methods ai'e further described in PCX patent publication Nos. 

5 91/17271,91/18980, and 91/19818 and 93/08278. 

[231] Examples of other display systems include ribosome displays, a 
nucleotide-linked display {see, e.g., U.S. Patent Nos. 6,281,344; 6,194,550, 6,207,446, 
6,214,553, and 6;,258,558), polysome display, cell surface displays and the lilce. The cell 
surface displays include a variety of cells, e.g., coH, yeast and/or mammalian cells* When 

10 a cell is used as a display, the nucleic acids, e.g., obtained by PGR amplification followed by ' 
digestion, are introduced into the cell and translated. Optionally, polypeptides encoding the 
monomer domains or the multimers of the present invention can be introduced, e.g., by 
injection, into the cell. 

[232] The monomer and multimer libraries of the invention can be screened 

15 for a desired property such as binding of a desired ligand (e.g., c-MET) or mixture of ligands. 
For example, members of the library of monomer domains can be displayed and prescreened 
for binding to a known or unknown ligand or a mixture of ligands. The monomer domain 
sequences can then be mutagenized(e*g*, recombined, chemically altered, etc.) or otherwise 
altered and the new monomer domains can be screened again for binding to the ligand or the 

20 mixture of ligands with an unproved affinity. The selected monomer domains can be 

combined or joined to fonn multimers, which can then be screened for an improved affinity 
or avidity or altered specificity for the ligand or the mixture of ligands. Altered specificity 
can mean that the specificity is broadened, e.g., binding of multiple related ligands, or 
optionally, altered specificity can mean that the specificity is narrowed, e.g., binding within a 

25 specific region of a ligand. Those of skill in the art will recognize that there are a number of 
methods available to calculate avidity. See^ e.g.^ Mammen et al, Angew Chem Int Ed. 
37:2754-2794 (1998); Muller et aU Anal Biochem. 261:149-158 (1998). 

{233] Those of skill in the art will recognize that the steps of generating 
variation and screening for a desired property can be repeated (i.e., performed recursively) to 

30 optimize results. For example, in a phage display library or other like fonnat, a first 

screening of a library can be performed at relatively lower stringency, thereby selected as 
many particles associated with a target molecule as possible. The selected particles can then 
be isolated and the polynucleotides encoding the monomer or multimer can be isolated fi'om 
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Hho particles. Additional variations can then bo generated from these sequences and 
subsequently screened at higher affinity. 

[234] All the compositions of flie present invention, e.g., monomer domains 
as well as multimers and libraries thereof can be optionally bomd to a matrix of an affinity 

5 material. Examples of affinity material include beads, a column^ a solid support, a 
microarrayj other pools of reagent-supports, and the like, 

(235) When multimers capable of binding relatively large targets are desired, 
they can be generated by a "walking" selection method. This method is carried out by 
providing a library of monomer domains and screening the library of monomer domains for 

1 0 affinity to a first target molecule. Once at least one monomer that binds to the target is 

identifiedj that monomer is covalently linked to a new library or each remaining member of 
the original library of monomer domains. This new library of multimers (dimers) is then 
screened for multimers that bind to the target with an increased affinity, and a multimer lhat 
binds to the target with an increased affinity can be identified. The "walking" monomer 

1 5 selection method provides a way to assemble a multimer that is composed of monomers that 
can act additively or even syner^stically with each other given the restraints of linker length. 
This walldng technique is very usefi^l when selecting for and assembling multimers that are 
able to bind large target proteins with hi^ affinity. The walking method cm be repeated to 
add more monomers thereby resulting in a multimer comprising 2, 3, 4, 5, 6, 7, 8 or more 

20 monomers linked together, 

[236] In some embodiments, the selected multimer comprises more than two 
domains. Such multimers can be generated in a step fashion, e.g., where the addition of each 
new domain is tested individually and the effect of the domains is tested in a sequential 
fashion. See, e.g., Figure 5. In an alternate embodiment, domains are liiiked to form 

25 multimers comprising more than two domains and selected for binding without prior 
knowledge of how smaller multhxiers, or altemativelyj how each domain, bind, 

[237] The methods of the present invention also include methods of evolving 
monomers or multimers. hatra-domain recombination can be introduced into monomers 
across the entire monomer or by taking portions of different monomers to form new 

30 recombined units. Interdomain recombination (e.g., recombining different monomers into or 
between multimers) or recombination of modules (e.g., multiple monomers within a 
multimer) maybe achieved. Inter-library recombination is also contemplated. 

[238| Methods for evolving monomers or multimers can comprise, e.g., any 
or all of the following steps: providing a plurality of different nucleic acids, where each 
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nucleic acid encoding a monomer domain; translating the pl-urality of different nucleic acids, 
which provides a plurality of different monomer domains; screening the plurality of different 
monomer domains for binding of the desired ligand (e.g., c-MET) or mixture of ligands; 
identifying members of the plurality of different monomer dom^ns that bind the desired 

5 ligand or mixture of ligands, which provides selected monomer domains; joining the selected 
monomer domains with at least one linker to generate at least one multimer, wherein the at 
least one multimer comprises at least two of the selected monomer domains and the at least 
one linker; and^ screening the at least one multimer for an improved affinity or avidity or 
altered specificity for the desired ligand or mixture of ligands as compared to the selected 

1 0 monomer domains , 

f239] Variation can be introduced into either monomers or multimers. An 
example of improving monomers? includes intra-domain recombination in which two or more 
{e.g,;^ three, four, five^ or more ) portions of (he monomer are amplified separately under 
conditions to introduce variation (for example by shuffling or other recombination method) in 

1 5 the resulting amplification products, thereby synthesizing a library of variants for different 
portions of the monomer. By locating the 5* ends of the middle primers in a "middle" or 
'overlap' sequence that both of the PGR fragments have in common, the resulting "left" side 
and "right" side libraries maybe combined by overlap PGR to generate novel variants of the 
original pool of monomers. These new variants may then be sci^eened for desired properties, 

20 e.g., panned against a target or screened for a fimctional effect. The "middle" primer(s) may 
be selected to correspond to any segment of the monomer, and will typically be based on the 
scaffold or one or more consensus amino acids within the monomer (e.g,, cysteines such as 
those found in A domains). 

[240) Similarly, multimers may be created by introducing variation at the 

25 monomer level and then recombining monomer variant libraries. On a larger scale, 
multimers (single or pools) with desired properties may be recombined to form longer 
multimers. In some cases variation is introduced (typically synthetically) into the monomers 
or into the linkers to form libraries. This may be achieved, e.g.^ with two different multimers 
that bind to two different targets, thereby eventually selecting a multimer with a portion that 

30 binds to one target and a portion that binds a second target 

[241] Additional variation can be introduced by inserting linkers of different 
length and composition between domains. This allows for the selection of optimal linkers 
between domains. In some embodiments, optimal length and composition of linkers will 
allow for optimal binding of domains. In some embodiments, the domains with a particular 
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binding affinity(s) are linked via different licJcers and optimal linkers are selected in a bindiag 
assay. For example, domains are selected for desired binding properties and then formed into 
a library comprising a variety of linkers. The library can then be screened to identify' optimal 
linkefB. Alternatively, mnltimer Ubraries can be formed where the effect of domain or linker 
on target molecule binding is not known. 

[242J Methods of the present invention ateo include generating one or more 
selected mnltimers by providing a plurality of monomer domains. The plurality of monomer 
domains is screened for binding of a desired hgand or mixture of ligands. Members of the 
plurality of domains that bind the desired Hgand or mixture of ligands are identified, thereby 
providing domains with a desired affinity. The identified domains are joined with at least one 
linker to generate the multimers, wherein each multimer comprises at least two of the 
selected domains and the at least one linker; and, the mvitimers are screened for an improved 
affinity or avidity or altered specificity for the desired ligand or mixture of ligands as 
compared to the selected domains, thereby identifying the one or more selected multimers, 

[243] Multimer libraries may be generated, in some embodiments, by 
combining two or more libraries or monomers or multimers in a recombmase-based 
approach, where each library member comprises as recombination site (e.g., a iox site). A 
larger pool of raolecularly diverse library members in principle harbor more variants with 
desired properties, such as higher target-binding affinities and fimctional activities. When 
libraries are constructed in phage vectors, which may be transformed into E. coli, library size 
(10^ - lO'") is limited by the transformation efficiency of j^. coli. A 
recombinase/recombination site system (e.g., the Cre-teP system) and in vivo recombination 
can be exploited to generate libraries that are not limited in size by the transformation 

efficiency of E. coli. 

[244] For example, the Cre-loxP system may be used to generate dimer 
libraries with 10^°, lO", 10^^ 10'^ or greater diversity. In some embodiments, E. coli as a 
host for one naive monomer library and a filamentous phage that carries a second naive 
monomer library are used. The library size in this case is limited only by the number of 
infective phage (carrying one library) and the number of infectible E. coli cells (carrying the 
other library). For example, infecting 10'=^ E. coli cells (IL at OD600=1) with >10^' phage 
could produce as many as 10^^ dimer combinations. 

[245] Selection of multimers can be accomplished using a variety of 
techniques including those mentioned above for identifying monomer domains. Other 
selection meihods include, e.g., a selection based on an improved affinity or avidity or altered 
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Specificity for the ligand compared to selected moBomer domains. For example, a selection 
can be based on selective binding to specific cell types, or to a set of related cells or protein 
types {e,g,s different virus serotypes). Optimization of the property selected for^ e.g,, avidity 
of a ligand, can then be achieved by recombioing the domains, as well as manipnlating amino 

5 acid sequence of the individual monomer domains or the linker domain or the nucleotide 
sequence encoding snch domains^ as mentioned in the present invention. 

[246] One method for identifying mnltimers can be accomplished by 
displaying the mnltimers. As with the monomer domains, the mnltimers are optionally 
expressed or displayed on a variety of display systemSj e.g^^ phage display^ ribosome display^ 

10 polysome display, nnoleotide-lihked display {see, e.g. , U,S, Patent Nos. 6^281 j344; 

6,194,550, 6,207,446, 6,214,553, and 6,258,558) and/or cell surface display, as described 
above. Cell snrface displays can inclnde bnt axe not limited to B, coh\ yeast or mammalian 
cells. In addition, display libraries of mnltimers v/ith multiple binding sites can be panned for 
avidity or affinity or altered specificity for a ligand or for multiple ligands. 

1 5 [247J Monomers or mnltimers can be screened for target binding activity in 

yeast cells using a two-hybrid screening assay. In this type of screen the monomer or 
multimer library to be screened is cloned into a vector that directs the formation of a fusion 
protein between each monomer or multimer of the hbrary and a yeast transcriptional activator 
firagment (i.e., Gal4). Sequences encoding the "target" protein are cloned into a vector that 

20 results in the production of a fljsion protein between fee target and the remainder of the Gal4 
protein (the DNA binding domain). A third plasmid contains a reporter gene downstream of 
the DNA sequence of the GaI4 binding site, A monomer that can bind to the target protein 
brings with it the Gal4 activation domain, thus reconstituting a functional Gal4 protein. This 
functional Gal4 protein bound to the binding site upstream of the reporter gene results in the 

25 expression of the reporter gene and selection of the monomer or multimer as a target binding 
protein, (see Chien et.al, (1991) Proc. Natl Acad Sci. (USA) 88:9578; Fields S. and Song O. 
(1989) Nature 340: 245) Using a two-hybrid system for library screening is further 
described in U.S. Patent No, 5,811,238 (see also Silver S.C. and Hunt S.W. (1 993) MoL Biol 
Rep. 17:155; Dmfee et al. (1993) Genes Devel 7:555; Yang et ah (1992) Science 257:680; 

30 Luban et al (1993) Cell 73:1067; Hardy et al (1992) Genes Devel 6:801; Bartel et al. (1993) 
Biotechniques 14:920; and Vojtek et ah (1993) Cell 74:205). Another useful screening 
system for carrying out the present invention is the ExolifBCC? interactive screening system 
(Germino et al. (1 993) Proc. Nat Acad, Scl (U,S A.) 90:993; Guarente (1993) Proc, Nat 

Acad. Sol q^.^A)9Q\imy 



wo 2006/009888 PCTm$2005/021558 
{2481 Other variations inctade the use of multiple binding compounds, such 
that monomer domains, mnltimers or libraries of these molecnies can be simultaneously 
screened for a multiphciiy of ligands or compounds that have different binding specificity. 
Multiple predetermined ligands or compounds can be concomitantly screened in a single 
library, or sequential screening against a number of monomer domains or multamers. In one 
variation, multiple ligands or compounds, each encoded on a separate bead (or subset of 
beads), can be mixed and incubated with monomer domains, multimers or libraries of these 
molecules under suitable binding conditions, The collection of beads, comprising multiple 
ligands or compounds, can then be used to isolate, by affinity selection, selected monomer 
domains, selected multimers or library members. Generally, subsequent affinity screening 
rounds can include the same mixture of beads, subsets thereof, or beads containing only one 
or two individual ligands or compounds. This approach affords efficient screening, anr' is 
compatible with labox^tory automation, batch processing, and high throughput screening 
methods. 

[249] In another embodiment, multimers can be simultaneously screened for 
the ability to bind multiple ligands, wherein each ligand comprises a different label For 
example, each ligand can be labeled with a different fluorescent label, contacted 
simultaneously with a multimer or multimer library, Multimers with the desired affinity are 
then identified (e.g., by FACS sorting) based on the presence of the labels linked to the 
desired labels. 

[2501 Libraries of either monomer domains or multimers (referred in the 
following discussion for convenience as "affinity agents^*) can be screened (i.e., panned) 
simultaneously against multiple ligands in a number of different formats. For example, 
multiple ligands can be screened in a simple mixture, in an array, displayed on a cell or tissue 
(e,g,, a cell or tissue provides numerous molecules that can be bound by the monomer 
domains or multimers of the invention), and/or immobilized. The libraries of affinity agents 
can optionally be displayed on yeast or phage display systems. Similarly, if desired, the 
ligands (e.g,, encoded in a cDNA library) can be displayed in a yeast or phage display 
system. 

[251] Initially, the affinity agent library is panned against the multiple 
ligands. Optionally, the resulting "hits^* are panned against the ligands one or more times to 
enrich the resulting population of affinity agents. 

[2521 If desired, the identity of the mdividual affinity agents md/oi ligands 
can be determined. In some embodiments, affinity agents are displayed on phage. Affinity 
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agents identified as binding in the iBitial screen are divided into a iSrst mi second portion. 
The first portion is infected into bacteria^ resulting in either plaques or bacterial colonies, 
depending on the type of phage used. The expressed phage are immobilized and then probed 
with iigands displayed in phage selected as described below. 
5 [253] The second portion are coupled to beads or otherwise immobilized and 

a phage display library containing at least some of the Iigands in the original mixture is 
contacted to the inomobilized second portion. Phage that bind to the second portion are 
subsequently eluted and contacted to the immobilized phage described in the paragraph 
above. Phage-phage interactions are detected (e,g*j using a monoclonal antibody specific for 

10 the ligand-expressing phage) and the resulting phage polynucleotides can be isolated. 

[254] In some embodiments^ the identity of an affinity agent-ligand pair is 
determined. For example, when both the affinity agent and the ligand are displayed on a 
phage or yeast, the DNA j&om the pair can be isolated and sequenced. In some embodiments^ 
polynucleotides specific for the ligand and affinity agent are amplified. Amplification 

1 5 , primers for each reaction can include 5' sequences that are complementary such that the 
resulting amplification products are fused, thereby forming a hybrid polynucleotide 
comprising a polynucleotide encoding at least a portion of the affinity agent and at least a 
portion of the ligand. The resulting hybrid can be used to probe affinity agent or Hgand (e.g., 
cDKA-encoded) polynucleotide libraries to identify both affinity agent and ligand. 

20 [255] The above-described methods can be readily combined with ''walking" 

to simultaneously generate and identify multiple multimers, each of which bind- to a ligand in 
a mixture of Iigands, In these embodiments, a first library of affinity agents (monomer 
domains or multimers) are panned against multiple Iigands and the eluted affinity agents are 
linked to the first or a second library of affinity agents to form a library of multimeric affinity 

25 agents (e.g., comprising 2, 3, 4, 5, 6, 7 ^ 8, 9, or more monomer), which ai-e subsequently 
panned against the multiple Iigands. This method can be repeated to continue to generate 
larger multimeric affinity agents. Increasing the number of monomer domains may result in 
increased affinity and avidity for a particular target. For example, the inventors have found 
that trimers of monomer domains that bind CD28 have a higher affinity than dimmers, which 

30 in turn have a higher affinity than single CD28-bindrng monomer domains alone. Of course, 
at each stage, the panning is opttonaliy repeated to enrich for significant binders. In some 
cases, walking will be facilitated by inserting recombination sites (e.g.^ lox sites) at the ends 
of monomers and recombining monomer libraries by a recombinase-mediated event. 
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[256] The selected multimers of the above methods can he &rther 
manipulated, e.g., by recombining or shufflmg the selected multimers (recombination caa 
occjur between or within multimers or both), mutating the selected multimers, and the like. 
This results in altered multimers which then can be screened and selected for members that 
have an enhanced property compared to the selected multimer, thereby producing selected 
altered multimers. 

[257] In view of the description herein, it is clear that the following process 
may be followed, Naturally or non-naturally occurring monomer domains may be 
recombined or variants may be formed. Optionally the domains initially or later are selected 
for those sequences that are less likely to be immunogenic in the host for which they are 
intended. Optionally, a phage library comprising the recombined domains is panned for a 
desired affinity. Monomer domains or multimers expressed by the phage may be screened 
for IC50 for a target. Hetero- or homo-meric multimers maybe selected. The selected 
polypeptides may be selected for their affinity to any target, including, e,g., hetero- or homo- 
multimeric targets. 

[258] Linkers, multimers or selected multimers produced by the metiiods 
indicated above and below are features of the present invention. Libraries comprising 
multimers, e,g, a library comprising about 100, 250, 500 or more members produced by the 
methods of the present invention or selected by the methods of the present invention are 
provided. In some embodiments, one or more cell comprising members of the libraries, are 
also included. Libraries of the recombinant polypeptides are also a feature of the present 
invention, e.g., a library comprising about 100, 250, 500 or more different recombinant 
polypetides. 

[259] Compositions of the present invention can be bound to a matrix of an 
affinity material, e.g., the recombinant polypeptides. Examples of affinity material include, 
e.g., beads, a column, a soHd support, and/or the like, 

VII THERAPEUTIC Am PROPHYLACTIC TREATMENT METHODS 

{260] The present invention also includes methods of therapeutically or 
prophylactically treating a disease or disorder by administering in vivo or ex vivo one or more 
nucleic acids or polypeptides of the invention described above (or compositions comprising a 
phaimaceutically acceptable excipient and one or more such nucleic acids or polypeptides) to 
a subject, including, e.g., a manomal, including a human, primate, mouse, pig, cow, goat, 
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rabbit, rat, guinea pig, hajtnsterj horse^ sheep; or a non-mammalian vertebrate such as a bird 
(e.g., a chicken or duck)^ fish, or invertebrate, 

[261] c-MET antagonists J including c-MET-binding monomer domains or 
multimers of the invention, axe nsefiil in treatment of human cancers expressing c~MET. A 

5 compendium of human cancers known to express c-MET and/or its Hgand HGF can be found 
in Table 1, p. 922 of Birchmeierj C, Birchmeier, W., Gherardi^ E. & Vande Woude, G. F, 
Met, metastasis, motility and more. Nat Rev Mol Cell Biol 4, 915-25 (2003). c^MET 
antagonists are of therapeutic value in all of these cancers. More paxticnlarlyj c-MET 
antagonists are nsefal in meeting a significant \jnmet medical need in pancreatic cancer, 

1 0 mesothelioma, myeloma, head and neck cancer, lung (NSCLC) cancer, ovarian cancer, breast 
cancer, prostate cancer, colon cancer, glioblastoma and osteosarcoma. Other exemplary 
cancers include bladder^ breast, cervical, colorectalj oesophageal, gastric, kidneyj liver, lung, 
nasopharyngeal, gall bladder, prostate or thyroid cancer, osteosarcoma, synovial sarcoma, 
rhabdomosarcoma, MFH/fibrosarcoma, Kaposi's sarcoma, multiple myeloma, lymphomas^ 

1 5 adult T-cell leukemia, glioblastomas, astrocytomas^ melanoma, and Wilm's tumor, 

[262] Individuals can be treated, for example, by once weekly intravenous 
injections of a soluble formulation of a c-MET antagonist composed of c-MET-binding 
monomer domains or multimers of the invention, optionally in combination with one or more 
additional therapeutic entities, for example either biologic or chemotherapeutic, 

20 (263] In one aspect of the invention, in ex vivo methods, one or more cells or 

a population of cells of interest of the subject tumor cells^ tumor tissue sample, organ 
cells, blood cells, cells of the skin, lung, heart, muscle, brain, mucosae, liver, intestine, 
spleen, stomach, lymphatic system, cervix, vagina, prostate, mouth, tongue, etc.) are obtained 
or removed from the subject and contacted with an amount of a selected monomer domain 

25 and/or multimer of the invention that is effective in prophylacticaliy or therapeutically 
treating the disease, disorder, or other condition. The contacted cells are then returned or 
delivered to the subject to the site from which they were obtained or to another site {^.g,, 
including those defined above) of interest in the subject to be treated. If desired, the 
contacted cells can be grafted onto a tissue, organ, or system site (including all desciibed 

30 above) of interest in the subject using standard and well-known grafting techniques or, e.g., 
delivered to the blood or lymph system using standard delivery or trans&sion techniques. 

[264] The invention also provide in vivo methods in which one or more cells 
or a population of cells of interest of the subject are contacted directly or indirectly with an 
amount of a selected monomer domain and/or multimer of the invention effective in 
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prophylactically or Iherapeutically treating the disease, disorder, or other condition. In direct 
coutact/administration formats, iiie selected mononaer domain and/or multimer is typically 
administered or transferred directly to the cells to be treated or to fUe tissue site of interest 
(e.g., tumor cells, tumor tissue sample, organ cells, blood cells, cells of the skin, tang, heart, 
muscle, brain, mncosae, liver, intestine, spleen, stomach, lymphatic system, cervix, vagina, 
prostate, month, tongue, etc.) by any of a variety of formats, including topical administration, 
injection (e.g., by nsing a needle or syringe), or vaccine or gene gon dehvery, pushing into a 
tissue, organ, or skin site. The selected monomer domain and/or multimer can be ddivered, 
for example, intramuscularly, intradermally, subdermally, subcutaneously, orally, 
intraperitoneally, intrathecally, intravenously, or placed within a cavity of the body 
(including, e.g., during surgery), or by inhalation or vaginal or rectal administration. 

[2651 '^^ v^'™ i^idirect contact/administration formats, the selected 
monomer domain and/or multimer is typically administered or tcansferred indirectly to the 
cells to be treated or to the tissue site of interest, including those described above (such as, 
e.g., skin cells, organ systems, lymphatic system, or blood cell system, eta), by contacting or 
administering the polypeptide of the invention directly to one or more cells or population of 
cells from which treatment can be facilitated. For example, tumor cells within the body of 
the subject can be treated by contacting cells of the blood or lymphatic system, skin, or an 
organ with a sufficient amount of the selected monomer domain and/or multimer such that 
delivery of the selected monomer domain and/or multimer to the site of interest (e.g., tissue, 
organ, or cells of interest or blood or lymphatic system within the body) occurs and effective 
prophylactic or therapeutic treatment results. Such contact, administration, or transfer is 
typically made by using one or more of the routes or modes of administration described 
above. 

[2661 In another aspect, the invention provides ex vivo methods in which one 
or more cells of interest or a population of cells of interest of the subject (e.g., tumor cells, 
tumor tissue sample, organ cells, blood cells, cells of the sMn, lung, heart, muscle, brain, 
mucosae, liver, intestine, spleen, stomach, lymphatic system, cervix, vagina, prostate, mouth, 
tongue, etc.) are obtained or removed from the subject and transformed by contacting said 
one or more cells or population of cells with a polynucleotide construct comprising a nucleic 
acid sequence of the invention that encodes a biologically active polypeptide of interest {e.g., 
a selected monomer domain and/or multimer) that is effective in prophylactically or 
therapeutically treating the disease, disorder, or other condition. Tlie one or more cells or 
population of cells is contacted witih a sufficient amount of the polynucleotide construct and a 
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promoter controlling expression of said nticleic acid sequence such tliat uptake of the 
polynucleotide construct (and promoter) into the cell{s) occurs and sufficient expression of 
the target nucleic acid sequence of the invention results to produce an amount of the 
biologically active polypeptide^ encoding a selected monomer domain and/or multimerj 
effective to prophylactically or therapeutically treat the disease, disorder, or condition. The 
polynucleotide construct can include a promoter sequence (^r,^,^ CMV promoter sequence) 
that controls expression of the nucleic acid sequence of the invention and/or, if desired, one 
or more additional nucleotide sequences encoding at least one or more of another polypeptide 
of the invention, a cytokine^ adjuvant, or co-stimulatory molecule, or other polypeptide of 
interest, 

|267] Following transfection, the transformed cells are returned, delivered^ or 
transferred to the subject to the tis'^ue site or system from which they were obtained or to 
another site (e.g., tumor cells, tumor tissue sample, organ cells, blood cells, cells of the sMb, 
lung, heart, muscle, brain, mucosae, hver, intestine, spleen, stomach, lymphatic system, 
cervix, vagina, prostate, mouth, tongue, eta) to be treated in the subject If desired, the cells 
can be grafted onto a tissue, skin, organ, or body system of interest in the subject using 
standard and welWcnown grafting techniques or delivered to the blood or lymphatic system 
using standard delivery or transfiision techniques. Such delivery, administration, or transfer 
of transfonned cells is typically made by using one or more of the routes or modes of 
administi-ation described above. Expression of the target nucleic acid occurs naturally or can 
be induced (as described in greater detail below) and an amount of the encoded polypeptide is 
expressed sufficient and effective to treat the disease or condition at the site or tissue system. 

[268] In another aspect, the invention provides in vivo methods in which one 
or more celts of interest or a population of cells of the subject (e.g., including those cells and 
cells systems and subjects described above) are transformed in the body of the subject by 
contacting the cell(s) or population of cells with (or administering or transferring to the cen(s) 
or population of cells using one or more of the routes or modes of administration described 
above) a polynucleotide construct comprising a nucleic acid sequence of the invention that 
encodes a biologically active polypeptide of interest (e.g., a selected monomer domain and/or 
multimer) that is effective in prophylactically or therapeutically treating the disease, disorder, 
or other condition. 

(2691 The polynucleotide construct can be directly administered or 
transferred to cell(s) suffering from the disease or disorder (e,g., by direct contact using one 
or more of the routes or modes of administration described above). Alternatively, the 
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polynuoieotide construct can be indirectly administered or transferred to ceU(s) suffering 
from the disease or disorder by first directly contacting non-diseased cell(s) or other diseased 
cells using one or more of the rotJtes or modes of administration described above with a 
sufficient amonnt of the polymicleotide construct comprising the nncleic acid sequence 
encoding the biologically active polypeptide, and a promoter controlhng expression of the 
nucleic acid sequence, such that uptake of the polynucleotide construct (and promoter) into 
the cell(s) occurs and snfficient expression of the nucleic acid sequence of the invention 
results to produce an amount of the biolo^cally active polypeptide effective to 
prophylactically or therapeutically treat the disease or disorder, and whereby the 
polynucleotide construct or the resulting expressed polypeptide is transferred naturally or 
automatically from the initial delivery site, system, tissue or organ of the subject's body to 
the diseased site, tissue, organ or system of the subjec^t's body (e.g., vi? the blood or 
lymphatic system). Expression of the target nucleic acid occurs naturally or can be induced 
(as described in greater detail below) such that an amount of expressed polypeptide is 
snfficient and effective to treat the disease or condition at the site or tissue system. The 
polynucleotide construct can include a promoter sequence {e.g., CMV promoter seqnence) 
that controls expression of the nucleic acid sequence and/or, if desired, one or more 
additional nncleotide sequences encoding at least one or more of another polypeptide of the 
invention, a cytokine, adjuvant, or co-stimulatory molecule, or other polypeptide of interest. 

[270| In each of the in vivo and ex vivo treatment methods as described 
above, a composition comprising an excipient and the polypeptide or nucleic acid of the 
invention can be administered or delivered. In one aspect, a composition comprising a 
phatmaceutically acceptable excipient and a polypeptide or nucleic acid of the invention is 
administered or delivered to the subject as described above in an amount effective to treat the 
disease or disorder, 

(271] In another aspect, in each in vivo and ex vivo treatment method 
described above, the amount of polynucleotide administered to the cell(s) or snbject can be an 
amonnt such tliat nptake of said polynucleotide into one or more cells of the subject occurs 
and sufficient expression of said nucleic acid sequence results to produce an amount of a 
biologically active polypeptide effective to enhance an immnne response in the subject, 
including an immnne response induced by an immunogen (e.g. , antigen), hi another aspect, 
for each such method, the amount of polypeptide administered to cell(s) or subject can be an 
amount sufficient to enhance an inmmne response in the subject, including that induced by an 
immnnogen {e.g., antigen). 
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[272] In yet another aspect, in an in vivo or in vivo treatment method in 
which a polyn-acleotide constiiict (or composition comprising a polynucleotide constract) is 
used to deliver a physiologically active polypeptide to a subject^ the expression of the 
polynucleotide construct can be induced by "using an inducible on- and off-gene expression 
5 system. Examples of such on- and off-gene expression systems include the Tet-On'^'^ Gene 
Expression System and Tet-OfF^ Gene Expression System {see, e.g., Clontech Catalog 
2000, pg. 1 10-1 i 1 for a detailed description of each such system), respectively. Other 
controllable or inducible on- and off-gene expression systems are known to those of ordinary 
skill in the art. With such system, expression of the target nucleic of the polynucleotide 

1 0 construct can be regulated in a precise, reversible, and quantitative manner. Gene expression 
of the target nucleic acid can be induced, for examplej after the stable transfected cells 
containing the polynucleotide construct comprising the target nucleic acid are delivered or 
transferred to or made to contact the tissue site, organ or system of interest. Such systems are 
of particular benefit in treatment methods and formats in which it is advantageous to delay or 

1 5 precisely control expression of the target nucleic acid (e.g., to allow time for completion of 
surgery and/or heahng following surgery; to allow time for the polynucleotide constnict 
comprising the target nucleic acid to reach the site, cells^ system, or tissue to be treated; to 
allow time for the graft containing cells transformed with the construct to become 
incorporated into the tissue or organ onto or into which it has been spliced or attached, etc.), 

20 

mi ADDITIONAL MULTIMER USES 

[273] The potential applications of multimers of the present invention are 
diverse and include any use where an affinity agent is desired. 

[274] In some cases, a pair of monomers or multimers are selected to bind to 
25 the same target (i.e., for use in sandwich-based assays). To select a matched monomer or 
multimer pair, two different monomers or multimers typically are able to bind the target 
protein simultaneously. One approach to identify such pairs involves the following: 
(1 ) immobili^;ing the phage or protein mixture that was previously selected to bind the 
target protein 

30 (2) contacting the target protein to the immobilized phage or protein and washing; 

(3) contacting the phage or protein mixture to the bound target and washing; and 

(4) eluting the bound phage or protein without eluting the immobilized phage or 
protein. 
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[275] One use of the mtittimers or monomer domains of the inveation is use 
to replace antibodies or other affinity agents in detection or other affinity-based assays. Thus, 
in some embodiments, monomer domains or multimers are selected against the ability to bind 
components other than a target in a mixture. The general approach can inclxtde performing 

5 the affinity selection under conditions that closely resemble the conditions of the assay, 

including mimicking the composition of a sample during the assay. Thus, a step of selection 
could include contacting a monomer domain or multimer to a mixture not including the target 
ligand and selecting against any monomer domams or multimers that bind to the mixture. 
Thusj the mixtures (absent the target Hgaadj which could be depleted using an antibody, 

1 0 monomer domain or multimer) representing the sample in an assay (senmij blood, tissue, 
cellSj urine, semen, etc) can be used as a blocldng agent. Such subtraction is useM, e.g,, to 
create pharmaceutical proteins that bind to their target but not to other serum proteins or non- 
target tissues* 

[276] For example, the invention can be used in the application for creating 
15 antagonists, where the selected monomer domains or multimers block the interaction between 
two proteins, e.g., the a and p chains of Met and/or between Met and HGF. Optionally, the 
invention can generate agonists. For example, multimers binding two different proteins, e.g., 
enzyme and substrate, can enhance protein function, including, for example, enzymatic 
activity and/or substrate conversion, 
20 (277] In some embodiments, the monomer domains are used for Hgand 

inhibition, ligand clearance or ligand stimulation. Possible ligands in these methods, include, 
e.g., HGF. 

[278] If inhibition of ligand binding to a receptor is desired, a monomer 
domain is selected that binds to the ligand (e.g., HGF) at a portion of the ligand that contacts 
25 the ligand's receptor, or that binds to the receptor at a portion of the receptor that binds 
contacts the ligand, tliereby preventing the ligand-receptor interaction. The monomer 
domains can optionally be linked to a half-life extender, if desired. 

[279] Ligand clearance refers to modulating the half-life of a soluble ligand 
in bodily fluid. For example, most monomer domains, absent a half-life extender, have a 
30 short ha1f4ife. Thus, bindmg of a monomer domain to the ligand will reduce the half-life of 
the ligand, thereby reducing Hgand concentration by clearing the ligand through the kidney so 
long as the complex is no larger than the maximum size able to pass through the kidney (less 
than about 50 or 40 kD), The portion of the ligand (e.g., HGF) bound by the monomer 
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doirtain will generally not matter, though it may be beneficial to bind the ligand at the portion 
of the ligand that binds to its receptor (e.g., Met), thereby farther inhibiting the ligand's effect. 
This method is useful for reducing the concentration of any molecule in the bloodstream, 
[280] Alternatively, a multixner comprising a first monomer domain that 
5 binds to a half-life extender and a second monomer domain that binds to a portion of the 
ligand that does not bind to the ligand^s receptor can be used to increase the half-Hfe of the 
ligand. 

[281] In another embodiment, a maltimer comprising a first monomer 
domain that binds to the ligand and a second monomer domain that binds to the receptor can 
1 0 be used to increase the effective affinity of the Hgand for the receptor. 

[282] In anotlier embodiment, multimers comprising at least two monomers 
that bind to receptors axe used to bring two receptors into proximity by both binding the 
muhimer, thereby activating the receptors. 

[2831 Further examples of potential uses of the invention include monomer 
15 domains, and multimers thereof, that are capable of drug binding (e.g., binding 

radionucleotides for targeting, pharmaceutical binding for half-life extension of drugs, 
controlled substance binding for overdose treatment and addiction therapy), immune function 
modulating (e.g., irmnunogenicity blocking by binding such receptors as CTLA'-4j 
immunogenicity enhancing by binding such receptors as CDSOjOr complement activation by 
20 Fc type binding), and specialized delivery (e.g., slow release by linker cleavage^ 

eiectrotransport domains, dimerization domains, or specific binding to: cell entry domains, 
clearance receptors such as FcR, oral deUvery receptors such as plgR for trans-mucosal 
transport^ and blood-brain transfer receptors such as transferrinR). 

[284] In further embodiments, monomers or multimers can be linked to a 
25 detectable label (e,g,, Cy3, Cy5, etc.) or linlced to a reporter gene product (e.g., CAT, 
iuciferase^ horseradish peroxidase, alkaline phosphotase, GFP^ etc.). 

[285] Monomers or multimers of the invention that bind to Met may also be 
used in diagnostic and predictive applications in which is is useful to detect Met. For 
example, detection of Met can be used to predict prognosis of breast cancer, wherein higher 
30 abundance of Met than in a nonnal tissue indicates a poor prognosis. See, e.g.y U.S. Patent 
No. 6,673,559. 
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HL FURTHER MANIPULATING MONOMER DOMAINS AND/OR 

MULTIMER NUCLEIC ACIDS AND POLYPEPTIDES 

[286J As meiitioaed above, the polypeptide of the present invention can be 
altered. Descriptions of a variety of diversity generating procedijres for generating modified 
or altered nucleic add sequences encoding these polypeptides are described herein and the 

references cited thexein. 

1287] Another aspect of the present invention includes the cloning and 
expression of monomer domains, selected monomer domains, multimers and/or selected 
muWrners coding nucleic acids. Thus, muWraer domains can be synthesized as a single 
protein using expression systems well known in the art. General texts which describe 
molecular biological techniques useful herein, including the use of vectors, promoters and 
many other topics relevant to expressing nucleic adds such as monomer domains, selected 
monomer domains, multimers and/or selected multimers, include Berger and Kimmel, Guide 
to Moleculai- Clonin<^ TP^I^ni^i^ii..^ Methods in EnzvmoloKV volmoe 152 Academic Press, 
15 Jhc, San Diego, CA (Berger); Sambrook et al., Molecular Cloning - A Laboratory ManM 
(2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 
f Sambrook") and Oirrent Protocols in Molecular Biology , F.M. Ausubd et al., eds., Current 
Protocols, a joint venture between Greene Publishing Associates, hic. and John Wiley & 
Sons, foe, (supplemented through 1999) ("Ausubel")), Examples of techniques sufficient to 
20 direct persons of skill tlirough in vitro amplification methods, useful in identifying, isolating 
and doning monomer domains and multimers coding nudeic adds, inchiding the polymerase 
chain reaction (PCR) the ligase chain reaction (LCR), Q-ieplicase amplification and other 
UNA polymerase mediated techniques (e.g., NASBA), are found in Berger, Sambrook, and 
Ausubd, as well as Muffis et al, (1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide 
25 to Methods and Applications (bmis et al. eds) Academic Press Inc. San Diego, CA (1 990) 

(Innis); Amhdm & Levinson (October 1, 1990) C&EN 36-47; The Journal Of NIH Research 
(1991) 3, 81-94; (Kwoh et al (1989) Proc. Natl Acad Set USA 86, 1173; Guatelli et al 
(1990) Proc, Natl Acad. Sd. USA 87, 1874; Lomdl et al (1989) /. Clin. Chem 35, 1826; 
Landegren et al, (1988) Science 241, 1077-1080; Van Brunt (1990) Biotechnology 8, 291- 
30 294; Wu and Wallace, (1989) Gene 4, 560; Bairinger et al (1 990) Gene 89, 11 7, and 

Sooknmian and Malek (1995) Biotechnology 13: 563-564. hnproved methods of cloning in 
vitro amplified nuddc acids are described in Wallace et al, U.S. Pat. No. 5,426,039. 
Improved methods of amplifying large nucidc adds by PCR are summarized in Cheng et al 
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(1994) Nature 369: 684-685 and the references therein, in which PGR atnplicons of up to 
40kb are generated. One of skill will appreciate that essentially any RNA can be converted 
into a double stranded DNA suitable for restriction digestion, PGR expansion and sequencing 
using reverse transcriptase and a polymerase. See, Ausubel, Sambrook and Berger^ all supra. 

[288] The present invention also relates to the introduction of vectors of the 
invention into host cells, and the production of monomer domains, selected monomer 
domains, multimers and/or selected multimers of the invention by recombinant techniques. 
Host cells are genetically engineered (i.e., transduced, Ixansformed or ixansfected) with the 
vectors of this invention, which can be, for example, a cloning vector or an expression vector. 
The vector can be, for example^ in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional nutrient media modified as appropriate 
for activating promoters, selecting transformants, or amplifying the monomer domain, 
selected monomer domain, multimer and/or selected multimer gene(s) of interest. The 
culture conditions, such as temperature, pH and the like, are those previously used with the 
host cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, e.g., Freshney (1994) Culture of Animal Cells, a Manual of 
Basic Technique, tliird edition, Wiley- Liss, New York and the references cited ftierein, 

[289] As mentioned above, the polypeptides of the invention can also be 
produced in non-animal cells such as plants, yeast, fungi, bacteria and the like. Indeed, as 
noted throughout, phage display is an especially relevant technique for producing such 
polypeptides. In addition to Sambrook, Berger and Ausubel, details regarding cell culture 
can be found in Payne et al (1992) Plant Cell and Tissue Culture in Liquid Systems John 
Wiley & Sons, Inc. New York, NY; Gamborg and PhiHips (eds) (1 995) Plant Cell, Tissue 
and Organ Culture; Fundamental Methods Springer Lab Manual, Springei'- Verlag (Berlin 
Heidelberg New York) and Atlas and Parks (eds) The Handbook of Microbiological Media 
(1993) ORG Press, Boca Raton, FL. 

[290] The present invention also includes alterations of monomer domains, 
immuno-domains and/or multimers to improve pharmacological properties, to reduce 
immunogenicity, or to facilitate the transport of the multimer and/or monomer domain into a 
cell or tissue (e.g., through the blood-brain barrier, or through the skin). These types of 
alterations include a variety of modifications (e.g., the addition of sugar-groups or 
glycosylation), the addition of PEG, the addition of protein domains that bind a ceitain 
protein (e.g., HSA or other serum protein), the addition of proteins fragments or sequences 
that signal movement or transport into, out of and through a cell Additional components can 
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also be added to a multimer and/or monoxner domain to manipulate fee properties of the 
multimer and/or monomer domain, A variety of components can also be added including, 
e.g., a domain that binds a known receptor (e.g., a Fc-region protein domain that binds a Fc 
receptor), a toxin(s) or part of a toxin, a prodomain that can be optionally cleaved off to 
activate the multimer or monomer domain, a reporter molecule (e.g., gre^ fluorescent 
protein), a component that bind a reporter molecule (such as a radionuclide for radiotherapy, 
biotin or avidin) or a combination of modifications. 

X ANIMAL MODELS 

[291] Another aspect of &e invention is the development of specific non- 
human animal models in which to test fee immunogenicity of the monomer or multhner 
domains. The method of producing such non-human animal model comprises: introducing 
into at least some cells of a recipient non-human animal, vectors comprising genes encoding 
a plurality of human proteins from the same family of proteins, wherein the genes are each 
operably linked to a promoter that is fimctional in at least some of the cells into which the 
vectors are introduced such that a genetically modified non-human animal is obtained that 
can express the plurality of human proteins from the same family of proteins, 

[292] Suitable non-human animals employed in the practice of the present 
invention include all vertebrate animals, except huiuans (e.g., mouse, rat, rabbit, sheep, and 
the like). Typically, the pluraUty of members of a family of proteins includes at least two 
members of that family, and usually at least ten family members. In some embodiments, the 
plurality includes all laiown members of the family of proteins. Exemplary genes that can be 
used include those encoding monomer domains, such as, for example, members of the LDL 
receptor class A-domain family, the EGF-like domain family, as well as the other domain 
families described herein. 

[293] The non™human animal models of the present invention can be used to 
screen for hraiunogemcity of a monomer or multimer domain that is derived from the same 
family of proteins expressed by the non-human animal model The present invention 
includes the non-human animal model made in accordance with the method described above, 
as well as transgenic non-human animals whose somatic and germ cells contain and express 
DNA molecules encoding a plurality of human proteins from the same family of proteins 
(such as the monomer domains described herein), wherein the DNA molecules have been 
introduced mto the transgenic non-human animal at an embryonic stage, and wherein the 
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DNA molecules are each operably liiiJked to a promoter in at least some of the cells in which 
the DNA molecules have been introduced. 

[294] An example of a mouse model useful for screening LDL receptor class 
A-domain derived binding proteins is described as follows. Gene clusters encoding the wild 
type human LDL receptor class A-^domain monomers are amplified from human cells using 
PGR. Almost all of the 200 different A-domains can be amplified with only three separate 
PGR amplification reactions of about 7kb each. These fragments are then used to generate 
transgenic mice according to the method described above. The transgenic mice will 
recognize the human A-domains as "self ^ thus mimicking the "selfaess" of a human with 
regard to A-domains. Individual A-domain-derived monomers or multimers are tested in 
these mice by injecting the A-domain-derived monomei-s or multimers into the mice, then 
analyzing the immune response (or lack of response) generated. The mice are tested to 
determine if they have developed a mouse anti-human response (MAHR). Monomers and 
multimers that do not result in the generation of a MAHR are likely to be non^-immunogenic 
when administered to humans. 

[295] Historically, MAHR test in transgenic mice is used to test individual 
proteins in mice lhat are transgenic for that single protein. In contrast, the above described 
method provides a non-human animal model that recognizes an entire family of human 
proteins as "self," and that can be used to evaluate a huge number of variant proteins that 
each are capable of vastly varied binding activities and uses. 

XI SITS 

[296] Kits comprising the components needed in the methods (typically in an 
unmixed form) and kit components (packaging materials, instructions for using the 
components and/or the methods, one or more containers (reaction tubes^ columns, etc,)) for 
holding the components are a feature of the present invention. Kits of the present invention 
may contain a multimer library, or a single type of monomer or multimer. Kits can also 
include reagents suitable for promoting target molecule binding, such as buffers or reagents 
that facilitate detection, including detectably-labeled molecules. Standards for calibrating a 
ligand binding to a monomer domain or the like, can also be included in the kits of the 
invention, 

[297J The present invention also provides commercially valuable binding 
assays and kits to practice the assays. In some of the assays of the invention, one or more 
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ligand is employed to detect binding of a monomer domain, iuimmo-domains and/or 
m-ultimen Snch assays aie based on any known method in the art, e,g.j flow cytometry, 
fluorescent microscopy, plasmoB resonance, and the like, to detect binding of a ligand{s) to 
the monomer domain and/or mnltimer. 

[298] Kits based on the assay are also provided. The kits typically include a 
container, and one or more ligand. The kits optionally comprise directions for performing the 
assays, additional detection reagents, buffers, or instructions for the ijse of any of these 
components, or the like. Alternatively, kits can include cells, vectors, (e.g,, expression 
vectors, secretion vectors comprising a polypeptide of the invention), for the expression of a 
monomer domain and/or a mnltimer of the invention. 

[299] hi a further aspect^ the present invention provides for the use of any 
composition, monomer domain, immnno-domain, mnltimer, cell, cell culture, apparatus, 
apparatus component or kit herein, for the practice of any method or assay herein, and/or for 
the use of any apparatus or kit to practice any assay or method herein and/or for the nse of 
cells, cell cultures, compositions or other features herein as a therapeutic formnlation. The 
manufacture of all components herein as therapeutic formulations for the treatments 
described herein is also provided. 



XIL INTEGRATED SYSTEMS 

[300] The present invention provides computers, computer readable media 
and integrated systems comprising character strings corresponding to monomer domains, 
selected monomer domains, multimers and/or selected multhners and nucleic acids encoding 
sxtch polypeptides. These sequences can be manipulated by in silico recombination methods, 
or by standard sequence alignment or word processing softwai'e. 

[301] For example, different types of similarity and considerations of various 
stringency and character string length can be detected and recognized in the integrated 
systems herein. For example, many homology detennination methods have been designed 
for comparative analysis of sequences of biopolymers, for spell checking in word processing, 
and for data retrieval from various databases. Witii an understanding of double-helix pair- 
wise complement interactions aanong 4 principal nucleobases in natural polynucleotides, 
models that simulate annealing of complementary homologous polynucleotide strings can 
also be used as a foundation of sequence alignment or other operations typically performed 
on the character strings corresponding to the sequences herein (e.g., word-processing 



69 



wo 2006/&09888 FCTAJS2005/021558 

maitipulations, construction of figures comprising sequence or subsequence character strings, 
ontpnt tables, etc.). An example of a software package with GOs for calcailating sequence 
similarity is BLAST, which can be adapted to the present invention by inputting character 
strings con-esponding to the sequences herein. 

[302] BLAST is described in Altschul et aU (1990) J.MoLBioL 215:403- 
41 0. Software for performing BLAST analyses is publicly available through the National 
Center for Biotechnology Information (available on the World Wide Web at 
ncbi.nlm.nih.gov). This algorithm involves first identifying high scoring sequence pairs 
(HSPs) by identifying short words of length W in the query sequence, which either match or 
satisfy some positive-valued threshold score T when aligned with a word of the same length 
in a database sequence. T is referred to as the neighborhood word score threshold (Altschul 
et aL, supra). These initial neighborhood word hits act as seeds for initiating searches to find 
longer HSPs containing them. The word hits are then extended in both directions along each 
sequence for as far as the cumulative alignment score can be increased. Cumulative scores 
are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of 
matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). 
For amino acid sequences, a scoring matrix is used to calculate the cumulative score. 
Extension of the word hits in each direction are halted when: the cumulative alignment score 
falls off by the quantify X from its maximum achieved value; the cumulative score goes to 
zero or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide 
sequences) uses as defaults a wordlengfh (W) of 11, an expectation (E) of 10, a cutoff of 100, 
]Vi^5^ and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the 
BLOSUM62 scoring matrix {see Henikoff & Henikoff (1989) Proc.Natl Acad. Sci USA 
89:10915), 

[303J An additional example of a useful sequence alignment algorithm is 
PILEUP. PILEUP creates a multiple sequence ahgnment from a group of related sequences 
using progressive, pairwise alignments. It can also plot a tree showing the clustering 
relationships used to create the alignment. PILEUP uses a simplification of the progressive 
alignment method of Feng & Doolittie, (1987) J. MoL BvoL 35:351^360. The method used is 
similar to the method described by Higgins & Shaip, (1989) CABIOS 5:15M53. The 
program can align, e.g., up to 300 sequences of a maximum length of 5,000 letters. The 
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multiple aligwneat procedure begins with the pairwise aligraneut of the two most similar 
sequences, producing a cluster of two aligned sequences. This cluster can then be aligned to 
the next most related sequence or cluster of aligned sequences. Two clusters of sequences 
can be aligned by a simple extension of the pairwise alignment of two individual sequence. 
The final alignment is achieved by a series of progressive, pairwise aJigoments. The program 
can also be used to plot a dendogram or tree representation of clustering relationships. The 
program is run by designating specific sequences and their amino acid or nucleotide 
coordinates for regions of sequence comparison. For example, in order to detennine 
conserved amino acids in a monomer domain family or to compare the sequences of 
monomer domains in a family, the sequence of the invention, or coding nucleic acids, are 
aligned to provide structure-function information. 

1304] In one aspect, the computer system is used to perform "in silico" 
sequence recombination or shuffling of character strings corresponding to the monomer 
domains. A variety of such methods are set forth in "Methods For Making Character Strings, 
Polynucleotides & Polypeptides Having Desired caiaracteristics" by Selifonov and Stemmer, 
filed February 5, 1999 (USSN 60/1 1 8854) and "Methods For Making Character Strings, 
Polynucleotides & Polypeptides Having Desired Characteristics" by Selifonov and Stemmer, 
filed October 12, 1 999 (USSN 09/416,375). hi brief, genetic operators are used in genetic 
algorithms to change given sequences, e.g., by niimicking genetic events such as mutation, 
recombination, death and the like. Multi-dimensional analysis to optimize sequences can be 
also be performed in the computer system, e.g., as described in the '375 application. 

[305] A digital system can also instmct an oligonucleotide synthesizer to 
synthesize oligonucleotides, e.g., used for gene reconstraction or recombination, or to order 
oligonucleotides ftom commercial sources (e.g., by printing appropriate order forms or by 
linking to an order form on the internet). 

[306] The digital system can also include output elements for controlling 
nucleic acid synthesis {e.g., based upon a sequence or an aligiment of a recombinant, e.g., 
recombined, monomer domain as herein), i.e., an integrated system of the invention 
optionally includes an oligonucleotide synthesizer or an oligonucleotide synthesis controller. 
The system can include other operations that occur downstream from an alignment or other 
operation performed using a character string corresponding to a sequence herein, e.g., as 
noted above with reference to assays. 
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EXAMPLES 

[307] The following example is .offered to illustrate, but not to limit the 
claimed invention. 

Example 1 

[308] This example describes selection of monomer domains and the 
creation of mxiltimers. 

[309] Starting materials for identifying monomer domains and creating 
mnltimers from the selected monomer domains and procednres can be derived from any of a 
variety of hnman and/or non-himian sequences. For example, to produce a selected monomer 
domain with specific binding for a desired Hgand or mixture of ligands, one or more 
monomer domain gene(s) are selected from a family of monomer domains that bind to a 
certain ligand. The nucleic acid sequences encoding the one or more monomer domain gene 
can be obtained by PCR amplification of genomic DNA or cDNA, or optionally, can be 
produced synthetically using overlapping oligonucleotides, 

[310] Most coxmnonly, these sequences are then cloned into a cell surface 
display format {ie,, bacterial, yeast, or mammalian (COS) cell surface display; phage 
display) for expression and screening. The recombinant sequences are transfected 
(transduced or transformed) into the appropriate host cell where they are expressed and 
displayed on the cell surface. For example, the cells can be stained with a labeled (e.g., 
fluorescently labeled), desired ligand. The stained cells are sorted by flow cytometiy, and the 
selected monomer domains encoding genes are recovered (ag.j by plasmid isolation, PGR or 
expansion and cloning) from the positive cells* The process of staining and sorting can be 
repeated mnltiple times (e.g., using progressively decreasing concentrations of the desired 
ligand until a desired level of enrichment is obtained). Alternatively, any screening or 
detection method known in the art that can be used to identify cells that bind the desired 
ligand or mixture of ligands can be employed, 

[311] The selected monomer domain encoding genes recovered from the 
desired ligand or mixture of ligands binding cells can be optionally recombined according to 
any of the methods described herein or in the cited references. The recombinant sequences 
produced in this round of diversiiScation are then screened by the same or a different method 
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to identify recoxabinant genes with improved affinity for fhe desired or target Kgand. The 
diversification and selection process is optionally repeated imtil a desired affinity is obtained* 

[312] The selected monomer domain nucleic acids selected by the methods 
can be joined together via a linker sequence to create mnltimers, e.g,, by the combinatorial 
assembly of nucleic acid sequences encoding selected monomer domains by DNA ligation^ or 
optionally, PCR-based, self-priming overlap reactions. The nucleic acid sequences encoding 
the mnltimers are then cloned into a cell surface display format (;,e,j bacterial, yeast, or 
maxnmahan (COS) cell surface display; phage display) for expression and screening. The 
recombinant sequences are iransfected (transduced or transformed) into the appropriate host 
cell where they are expressed and displayed on the cell surface. For example, the cells can be 
stained with a labeled, e.g,, flnorescently labeled, desired Hgand or mixture of ligands. The 
stained cells are sorted by flow cytometry, and the selected multimers encoding genes are 
recovered (e.g., by PGR or expansion and cloning) fi-om the positive cells. Positive cells 
include multimers with an improved avidity or affinity or altered specificity to the desired 
ligand or mixture of ligands compared to the selected monomer domain(s). The process of 
staining and sorting can be repeated multiple times (e.g., using progressively decreasing 
concentrations of the desired ligand or mixtiire of ligands until a desired level of enrichment 
is obtained). Alternatively, any screening or detection method known in the art that can be 
used to identify cells that bind the desired Hgand or mixture of ligands can be employed. 

(313] The selected multimer encoding genes recovered from the desired 
ligand or mixture of ligands bindiag cells can be optionally recombined according to any of 
the methods described herein or in the cited references. The recombinant sequences 
produced in this round of diversification are then screened by the same or a different method 
to identify recombinant genes with improved avidity or affinity or altered specificity for the 
desired or target ligand. The diversification and selection process is optionally repeated until 
a desired avidity or affinify or altered specificity is obtained. 

Example 2 

[314] This example describes in vivo intra-protein recombination to generate 
libraries of greater diversity. 

[315] A monomer-encoding plasmid vector (pCK-derived vector; see below), 
flanked by orthologous loxP sites, was recombined in a Cre~dependent mamier with a phage 
vector via its compatible loxP sites. The recombinant phage vectors were detected by PCR 
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lising primers specific for tlie recombinant construct. DNA sequencing indicated that the 
correct recombinant product was generated. 

Reagents and experimental procedaires 

|316] pCK-cre-lox-Monomer-loxP. This vector has two particulaarly 
relevant features. First, it carries flie ere gene, encoding the site-specific DNA recombinase 
Cre, under flie contol of Piao- Cre was PCR-amphfied from p705-cre (from GmeBridges) 
with cre-specific primers that incorporated Jf&al (5') and Sfil (3') at the ends of the PGR 
product. This product was digested withal and SJil and cloned into the identical sites of 
pCK, a bid, Cm^ derivative of pCKl 10919-HC-Bla (pACYC on), yielding pCK-cre. 

1317) The second feature is the naive A domain library flanked by two 
orfhologcus loxP sites, /ojcP(wild-type) and /oxF(FAS), which are required for the site- 
specific DNA recombination catalyzed by Cre. See, e.g. , Siegel, R.W., et al . FEBS Letters 
505:467-473 (2001). Th^e sites rarely recombine with another. /oxP sites were biiilt into 
pCK-cre sequentially. S'-phosphorylated oligonucleotides IoxP(K) and loxP(K_rc), carrying 
loxPCWT) and EcoM and if/wDlD-compatible overhangs to allow ligation to digested EcoRl 
and FiwDin^digested pCK, were hybridized together and Ug^ted to pCK-cre in a standard 
ligation reaction (T4 ligase; ovemi^t at 160). 

[318] The resulting plasmid was digested with EcoRl and and ligated to 
the hybridized, 5'-phosphorylated oHgos loxP(L) and loxP (L^rc), which carry /oxP(FAS) 
and EcoRl and 5^M^compatible overhangs. To prepare for library construction, a large-scale 
purification (Qiagen MAXI prep) of pCaC-cre-loX"P(wt)-loxP(FAS) was performed according 
to Qiagen's protocol. The Qiagen-purified plasmid was subjected to CsCl gradient 
centrifugation for further purification. This construct w^ then digested with Sphl and Bglll 
and ligated to digested nai've A domain library insert, which was obtained via a PCR- 
amplification of a preexisting A domain library pool. By design, the loxP sites andmonomer 
are in-frame, which generates monomers witii /oxP-encoded linkers. This library was utilized 
in the in vivo recombination procedure as detailed below. 

[319] fUSESHA-Mononier-lox-lox vector. The vector is a derivative of 
fUSE5 from George Smith's laboratory (University of Missouri). It was subsequently 
modified to carry an HA tag for immunodetection assays. loxP sites were buUt into 
fUSE5HA sequentially. 5'phosphorylated oHgonucleotides loxP(I) a/M?loxP(I)_rc, carrying 
loxP(WT), a string of stop codons andZmal and 5/jI-compatible overhangs, were hybridized 
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together and ligated to Xmal- and Sfil-dig^sted fUSESHA in a standard ligation reaction 
(New England Biolabs T4 ligase; overnight at 16C). 

[320] The resulting phage vector was next digested with Xmal and Sphl and 
ligated to the hybridized oligos loxP(J) aod loxP(J)_re, which carry loxP(S AS) and overhangs 
5 compatible with Xmal and Sphl This construct was digested with XmaVSfll and then ligated 
to pre-cut {XmaVSfiT) naive A domain library insert (PGR product). The stop codons are 
located between the loxP sites, preventing expression of glll and consequently, the 
production of infectious phage. 

[321] The ligated vector/library was subsequently transformed into an E. coll 
1 0 host bearing a g/ff-expressing plasmid that allows the rescue of the fUSESHA-Monomer-lox- 
lox phages as detailed below. 

[3221 pCK-gJXr This plasmid carries glU under the control of its native 
promoter. It was constructed by PCR-amplifying ^77/ and its promoter from VCSM13 helper 
phage (Stratagene) with primers glllPromoter^EcoRI and gniPtomoter__HinDni This 
1 5 product was digested with EcoM and HinDlll and cloned into the same sites of pCKl 1 09 1 9- 
HC-Bla. As g/7/is under the control of its own promoter, glll expression is presumably 
constitutive. pCK-g777 was transformed into E. coU EC 100 (Epicentre). 

[323] In vivo recombmation procedure. In summary, the procedure 
involves the following key steps: a) Production of infective (i.e. r^cue) of fUSESHA- 
20 Monomer-loX"lox library with an E, coli host expressing glU from a plasmid; b) Cloning of 
2'''^ Hbrary (pCK) and transformation into TGI E. coli; c) Infection of the culture carrying 
the 2"^ library with the rescued fQSE5HA-Monomer4ox-Iox phage Hbrary. 

[324] a. Rescue of phage vector, Electrocompetent cells carrying pCK-g777 
were prepared by a standard protocol These cells had a transfonnation frequency of 4 x 
25 lOVfig DNA and were electroporated with large-scale ligations (-5 jag vector DNA) of 
jPUSE5HA-lox-lox vector and the naive A domain library insert. After individual 
electroporations (100 ng DNA/electroporation) with 70 juL cells/cuvette, 930 p.L warm 
SOC media were added, and the cells were allowed to recover with shaking at 37C for 1 
hom\ Next, tetracycline was added to a final concentration of 0.2 |utg/mLj aud the cells were 
30 shaken for - 45 minutes at 37C, An aliquot of this culture was removed, 1 0-fold serially 
diluted and plated to determine the resulting library size (1 .8 x 10^). The remaining culture 
was diluted into 2 x 500 mL 2xYT (with 20 jig/mL chloramphenicol and 20 jiig/mL 
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tetracycliQe to select for pCK-gffi and the ftJSE5HA-based vector, respectively) and grown 
oYeniight at 30C* 

{325] Rescued phage were harvested nsing a standard PEG/NaCl 
precipitation protocol The titer was approximately 1 x 10"^^ transducing imits/mL. 

5 [326] b. Cloning of the 2"^ library and tramformation into an E, colt host 

The ligated pCK/ naive A domain library is electroporated mto a bacterial F''" host, with an 
expected library size of approximately 10^. After m hoiur-long recovery period at 37C with 
shaking, the electroporated cells are diluted to ODeoo^ 0.05 in 2xYT (plus 20 f^g/mL 
chloramphenicol) and grown to mid-log phase at 37C before infection by fUSEHA- 

1 0 Monomer-lox-lox* 

1327] c. Infection of the culture carrying the library with the rescued 
fUSE5HA-Monomer4ox4ox phage library. To maximize the generation of recombinants^ a 
high infection rate (> 50%) of RcoU within a culture is desirable. The infectivity ofB. coli 
depmds on a nnmber of factors, including the expression of the F pilus and growth 

15 conditions. E. coli backgrounds TGI (carrying an F') and K91 (an Hfr strain) were hosts for 

the recombination system, 

Oli^ojnacleotides 
loxP(K) 

[P-5' agcttataacttcgtatagaaaggtatatacgaagttatagatctcgtgctgcatgcggtgcg] 

20 

loxP(K_„rc) 

[P-5' aattcgcaccgcatgcagcacgagatctataacttcgtatatacctttctatacgaagttataagct] 
loxP{L) 

25 [F-5' ataacttcgtatagcatacattatacgaagttatcgag] 

loxP {L_„rc) 

[P-5'' ctcgataacttcgtataatgtatgctatacgaagttatg] 
30 loxPCO 

[PS'ccgggagcagggcatgctaagtgagtaataagtgagtaaataacttcgtatatacctttctatacgaagttatcgtctg] 
loxP(I)_rc 

[P-S' acgataacttcgtatagaaaggtatatacgaagttatttactcacttattactcacttagcatgccctgctc] 

35 

loxP(J) 

[5 ' cogggaccagtggcctctggggccataacttcgtatagcatacattatacgaagttatg] 
loxP(J)_rc 

40 [5' cataacttcgtataatgtatgctatacgaagttatggccccagaggccactggtc] 

gIIIPromoter_EcoRI 
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[5' atggcgaattctcattgtcggcgcaactat 

gIIIPromoter_HinDin 

[5* gataagctttcattaagactccttattacgcag] 



15 



20 



25 



30 



35 



Example 6 



library. 



[328] This example describes construction of an EGF-based monomer 



1329] The CaEGF domain library, E3, encodes a protein domain of 36-43 

1 0 amino acids having the following pattern: 

X(5)C1-X(4/6)^C2-X(4,5)-C3^X(8)-C4^X(1>C5-X{8/12)-C6 

(330] The table below describes for each position which amino acids ai'e 
encoded in the library based upon the natural diversity of human calcium binding EGF 
domains: 
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1331] The library of DNA sequences, E3, encoding monomeric calcinm 
binding EGF domains, was created by assembly PGR as described m Stemmer et al. Gene 
164:49-53 (1995). The oligonucleotide nsed in this PGR reaction are in two groups, 1 and 2, 
They are: 

Group 1 : 

1 5'"AAAAGGCCTCGAGGGCCTGGGTGGCAATGGT-3' 

2 S'^CCTGAACCACCACAKHKACCGYICSNBGCACGGAYYCGRCRMACATTC 

ATYAAYATCTDYACCATTGCCACCC-3' . . . ^ . 

3 5^.CCTGAACCACCACAia^TGSCGYYGYKMHSGCACGGAYYCGRCRMACATTC 

AT^AAYATCTDYACCATTGCCACCC"3^ 
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4.5'^CCTGAACCACCACAKHKACCGYKSNBGCAARBAYBCGVAHyCWSKBYAC 
ATTCATYAAYATCTDYACCATTGCCACCC-3' 

5,5'"CCTGAACCACCACAKNTGSCGYYGYKMHSGCAARBAYBCGVAI-rYCWSKBY 
ACATTCATYAAYATCTDYACCATTGCCACCC-3' 
5 6. S^TGAATTTTCTGTATGAGGTTTTGCTAAACAACTTrCAACAGTTTCGGCCC 
CAGAGGCCCTGGAGCCACCTGAACCACCACA-3' 

Group 2: 

1 . S^-'ACGGTGCCTACCCGTATGATGTTCCGGATTATGCCCCG 
10 GGTGGCAATGGT-3^ 

2. 5'^CCTGAACCACCACAGHKTDBACCGGHAWAGCCTKSCRSGCASHBACAK 
YTCAWAGCYACCCDSTRWATYTWBACCATTGCCACCC-3^ 

3. 5'-CCTGAACCACCACAiCBYKBTKCYGKYCBSABYCNGCDBAWAGCCTK 
BGBKGCASHBACAKYKAWAGCYACCCDSmWATYTWBACCATTGCCACCC^S^ 

15 4. 5^-AAAAGGCCCCAGAGGCCCCTGAACCACCACA~3* 

where R=A/G, Y-C/T, M-A/C, K-G/T, S=C/G, W-^ATT, B-C/G/T, D-A/G/T, H^-A/C/T, 

V-A/C/G, andN=A/C/G/T. 

[332] Following the separate PCRs of the Group 1 and 2 oligonudeotides, 

the Group 1 PGR j&agments were digested with Bpml and group 2 PGR fragments were 

20 digested with BsrDL Digestion products were purified using Qiagen Qiaqmck columns and 

then Hgated together. The hgated DNA was then amplified in a PGR using two primers. 

These are; 



5'-AAAAGGCCTCGAGGGCCTGGGTGGCAATGGT-3' 
25 5^"AAAAGGCCCCAGAGGCCCCTGAACCACCACA"3' 



{333] The PGR products were purilBed with Qiagen Qiaquick columns and 
digested with SfiL The digested product was purified with Qiagen Qiaquick columns. The 
DNA fragments were Hgated into the Sfil restriction sites of phage display vector fase5- 

30 H A(G4S)4, a derivative of fuseS caixying an in-frame HA-epitope and a glycine^ serine 
flexible linlcer. The Ugation mixture was electroporsted into TransforMax™ EC! 00™ 
eiectrocompetent E. coli cells, Transfoimed E. coli cells were grown overnight at 3TC in 
2xYT medium containing 20 ]xg/ml tetracycline. The resulting library contained 2X1 0^ 
independent clones. Phage particles were purified from the culture medium by PEG- 

35 precipitation. The titer of the phage was 13X10'^/nil, The sequences of 24 individual clones 
were determined and these were consistent with the library design. 
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Example 3 

[334] This example describes construction of an EGF-based monomer 

library. 

[335] Recombination can be used for intradomain optimization, FOr 
example a PGR overlap reaction can be -used that recombines two or more segments of a 
single domain relative to each other. One can use two, Chxee, four, five or more fragment 
overlap reactions in the same way as illustrated. This recombination process has many 
applications. One application is to recombine a large pool of hundreds of previously selected 
clones wittiout sequence information. All that is needed for each overlap to work is one 
known region of (relatively) constant sequence that exists in the same location in each of the 
clones (fixed site approach). For A domains, typically these clones would have been derived 
from a library in which 20-25 amino acids distribitted over all five mter-cysteine segments 
were randomized. The intra-domain recombination method can also be performed on a pool 
of sequence-related monomer domains by standard DNA recombination {e.g., Stenomer, 
Nature 370:389-391 (1994)) based on random fragmentation and reassembly based on DNA 
sequence homology, which does not require a fixed overlap site in all of the clones that are to 
be recombined. 

[336) Another application of feis process is to create multiple separate, naive 
(meanmg nnpanned) libraiies in each of which only one of the intercysteine loops is 
^andomi^ed, to randomize a different loop in each library. After panning of these libraries 
separately against the target, the selected clones are then xecombined. From each panned 
library only the randomized segment is amplified by PGR and multiple randomized segments 
are then combined into a single domain, creating a shuffled library which is panned and/or 
screened for increased potency. This process can also be used to shuffle a small number of 
clones of known sequence. 

[337j Any common sequence may be used as cross-over pomts. For A 
domains or other cysteine-containing monomers, the cysteine residues are logical places for 
the crossover. However, there are otiier ways to detemme optimal crossover sites, such as 
computer modehng. Alternatively, residues with highest entropy, or the least number of 
intramolecular contacts, may also be good sites for crossovers, 

|338] An exemplary method of generating libraries comprised of proteins 
with randomized inter-cysteine ioops is presented below. In this example, in conti'ast to the 
separate loop, separate library approach described above, multiple intercysteine loops are 
randomized simultaneously in the same library. 
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[339J An A domain NNK library encoding a protein domain of 39-45 amino 
acids haying the following pattern was constructed: 

Cl"X{4,6)-El-F"Rl-C2"A-^X<2,4)'-Gl"R2-C3J^P-Sl"S2"W"V-C4^Dl-G2^E2-» 
G3"D4-G4-S3"D5-E3-X(4,6)"C6; 

5 where, 

ChC6: cysteines; 

X(ti): sequence of n amino acids with any residue at each position; 

E1-E3: gjutamine; 

F; phenylalanine; 
10 RM12: arginine; 

A: alanine; 

G1-G4: glycine; 

I: isoleucine; 

P: proline; 
15 Si-SB; serine; 

W: tryptophan; 

V: valine; 

D1-D5: aspartic acid; and 

CI -03, C2^C5 & C4-C6 fom disulfides, 

20 f 340] The library was constinicted by creating a library of DNA sequences, 

containing tyrosine codons (TAT) or variable non-conserved codons (NNK)^ by assembly 
FCR as described in Stemmer et al. Gene 164:49-53 (1995), Compared to the native A- 
domain scaffold and the desiga that was used to construct library Ai (described previously) 
this approach: I) keeps more of the existing residues in place instead of randomizing these 

25 potentially critical residues, and 2) inserts a string of amino acids of variable length of all 20 
amino acids (NNK codon), such that the average number of inter-cysteine residues is 
extended beyond that of the natural A domain or the Al library. The rate of tyrosine residues 
was increased by including tyrosine codons in the oligonucleotides^ because tyrosines were 
found to be overrepresented in antibody binding sites^ presumably because of the large 

30 number of different contacts that tyrosine can make. The oligonucleotides used in this PGR 



reaction are: 



1. 5 

2, 5 

35 3^ 

4, 5 

5, 5 

6, 5 

7, 5 
40 8. 5 

9. 5 

10. 5 

11. 5 

12. 5 
45 13. 5 



-ATATCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTNNKNNKNNKWNKGAATTCCGA- 3 ' 
-ATMCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTNNKNNKmKNWraNKGAATTCCGA" 3 ' 
- AT ATCCCGGGT CTGGAGGCGTCTGGTGGT TCGTGTl^l^K^Nlt^ KlS^it^KNNKKINKMN KG AATTCCGA- 

-ATATCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTTATNNK^^MKNNKGAATTCCGA- 3' 
-ATATCCCGGGTCTGGAGGCGf CtGGTGGTTCGTGTiE?^NKTATJ:^NKWNKNNKGAATTCCGA" 3 ' 
"ATATCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTi^HKTATNNKNNKGAATTCCGA- 3 ' 
-ATATCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTNNKlSIiSIKTATl^NKGAATTCCGA" 3 ' 
"-ATATCCCGGGTCTGGAGGCGTCTGGTGGTa?CGTGTNlSIPa>!NKNt^KTATGAATTCCGA- 3 ' 
-ATATCCCGGGTCTGGAGGCGTCTGGTGGTTCGTGTt^iSiKt^NKNlSiKTATNNKGAATTCCGA- 3 ' 
-ATACCCAAGAAGACGGTATACATCGTCCMNlJ^MiNil^^TGCACATCGGAATTC'" 3 ' 
-ATACCCAAGAAGACGGTATACATCGTCCMni^MtvlNMNlSfTGCACATCGGAATTC- 3 ' 
"ATACCCAAGAAGACGGTAT ACATCGTCCH^^ll:^MNWMI^IHM^;|tsITGCACATCGGAATTC" 3 ' 
~ATACCCAAGAAGACGGTATACATCGTCCATAM1^NM!:^NTGCACATCGGAATTC- 3 ' 
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14 5' -ATACCraAGM^GACGGTATaCATCGTCCMNNATAMi^NMMWTGGACATCGGAATTC-- 3' 
1^' 5' -ATACCCAAGAAGACGGTATACATCGTCCMNKATAMNHTGCACATCGGAATTC- 3 
16 5' -ATACCCAAGAAGACGGTATRCATCGTCCMflNMNNaTATGCACATCGGAATTC- 3 
17' 5' -MACCCAAGAAGACGGTATACATCGTCCMNNMNNATAMNNTGCACATCGGAATTC- 3 
S 18 5' -ACCGTCTTCTTGGGTATGTGACGGGGAGGACGATTGTGGTGACGGATCTGACGAG" 3' 
I' -MATGGCCCCAGAGGCCTGCAATGATCCACCGCCCCCACAMKNMNNMNNMNNCTCGTCAG 

2j';'^'^'^;,^lATAT6GCCCCAGAGGCCTGCaaTGATCCACCGCCCCCACAMl^NMNNHMSlHNNMN^ 
10 If.^'^'^l' ~ATATGGCCCCAGAGGCCTGCAATGATCCACCGCCCCCACAMWNMWNMNKM»NMMC 
22':'^'^ffSSGGCCCCAGAGGCCTGCAATGAfCCACCGCCCCCACAATAMl5NMNNM«WCTCGTC 

2^^^^5r4mTGGCCCCAGAGGCCTGCAATGATCCACCGCCCCCACAMNmTAMNNMjqNMflKCT 
15 CGTCAGATCCGT-_^3^^^^^^^^^^^^^^^_^^^^^^^^^^ 

3^^''^5?^:?VTlTGGCCCCAGAGGCCTGCAATGATCCACCGCCCCCACAiiyNMMAMNNCTCG 

20 2™V^^ATMGGCCCCAGAGGCCTGCAATGATCCACCGCCCCCACAM^^N^SNNMNmTACTCG 
TCAGATGCGT- 3' 

SATGGCCCO^GAGGCCTGCAATGATCCACCGCCCCCACAMNmNNMNNAM 

Where R-A/G, Y-C/T, M=A/C, K=G/T, S=C/G, W^^A^T, B=C/G/T, D=A/G/T, H==A/cyT, 
25 V=A/C/G,andN=A/C/GyT ^ , „ 

|341] The library was constmcted though an initial romd of 10 cycles ot 

PCR amplification using a mixture of 4 pools of oligonucleotides, each pool coBtairang 
400pinols of DNA. Pool 1 contaijied oligonucleotides 1-9, pool 2 contained 10-17, pool 3 
contained only 1 8 and pool 4 contained 19-27.. The fully assembled library was obtained 
30 through an additional 8 cycles of PCR using pool 1 and 4. The Ubrary ftagments were 
digested witti Xmal and SfiL The DNA fragments were Ugated into the corresponding 
restriction sites of phage display vector fose5-HA, a derivative of fose5 carrying an in-frame 
HA-epitope. The Hgation mixture was electroporated into TransforMax-r^* gCl 00^^ 
dectrocompeteit^. co/^ cells resulting in a library of 2X1 0' individual clones. Transformed 
35 E. coll cells were grown overnight at 37°C in 2xYT medium containing 20 fxg/ml 

tetracycline. Phage particles were purified from the culture medium by PEG-precipitation 
and a titer of 1.1X1 0"/ral was determined. Sequences of 24 clones were detemiined and 
were consistent with the expectations of the library design. 

40 Example 4 

[342] This example describes optimization of multimers by optimizing 
monomers and/or linlcers for binding to a target. 

[3431 One approach for optimizing multimer binding to targets involves 
optimization of monomers, multimers and linkers. First a library of monomers is panned for 
45 binding to the target (e.g., Met). However, some ofthe monomers maybind at locations on 



81 



wo 2006/009888 PCT/US2005/021558 

the target that are far away from each other, such that the domains that bind to these sites 
cannot be connected by a linlcer peptide. It is therefore usefol to create and screen a large 
library of homo- or heterotrimers from these monomers before optimization of the 
monomers. These trimer libraries can be sca:eened, e,g., on phage (typical for heterotrimers 
5 created from a large pool of monomers) or made and assayed separately (e.g., for 

homottimers). By this method, the best trimer is identified. The assays may include binding 
assays to a target or agonist or antagonist potency determination of the multimer in fimctional 

protein- or cell-based assays. 

(344] The monomeric domain(s) of the single best trimei- are then optimized 
10 as a second step. Homomultraiere are easiest to optimize, since only one domain sequence 
exists, though heteromultimers may also be synthesized. For homomultimers, an increase in 
binding by the multimer compared to the monomer is an avidity effect 

[345] After optimization of the domain sequence itself (e.g., by recombining 
or NNIC randomization) and phage panning, the improved monomers are 
1 5 used to construct a dimer with a linker library. Linker libraries may be fornied, e.g., from 
linkers with an NNK composition and/or variable sequence length, 

(346] After panning of this linker library, the best clones (e.g., determined by 
potency in the inhibition or other functional assay) are converted into multimers composed of 
multiple (e.g., two, three, four, five, six, seven, eight, etc.) sequence-optimized domains and 
20 length- and sequence-optimized Hnkers. 

Example 5 

(347] This example describes a structural analj^is of A domains. 

[348] As with virtiially all proteins, only a small fraction of the total surface 
25 of an A-domain participates in binding a single target. Based on the solution structure of the 
domain, adjacent residue positions can be identified which are likely to be able to cooperate 
in binding to a given target. Herein, such groups of adjacent residues are referred to as 
stmctoral categories. As an example, four such categories have been identified tiirough 
examination of the A-domain structure, designated Top, Bottom, Loop 1, and Loop 2. By 
30 designing libraries which only allow diversity within a given category, the theoretical 
sequence space aUowed by a library can be significantly reduced, allowing for better 
coverage of the theoretical space by the physical library. Further, in the case of non- 
overlapping categories such as the Top and Bottom categories, half-domain sequences 
selected against different targets can be combined into a single sequence which would be able 
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to bind staltaaeously ot alternatively to the selected targets. In either case, creating binding 
sites that occupy only half a domain allows for the creation of molecules tiiat are half as large 
and would have half the number of immunogemc epitopes, reducing the risk of 
immunogemcity. 

Structural Classification of A-domam Positions 

A canonical A-domain sequence is shown below with high-diversity positions represented as 
aa X Positions that belong to either the Top, Bottom, Loop U or Loop 2 categories are 
designated with a star. 
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Example 6 

1 5 (3491 This exatnple describes screening for monomers or multimers that bind 

c-MET (also known as HGFR), 

[350] Phage libraries were panned through several rounds either on solid 

support (e.g. Nunc Maxisorp plates) or in solution (e,g. Dynal Strcptavidin or Protein A 

beads). Output phage pools with (a) the highest frequency of individual phage clones that 
20 bind to c-MET and (b) high sequence diversity among the binding-positive phage clones 

were chosen for protein screening. 

I. Round 1 {Maxisorp plates or Dynal Beads) 
25 1. Coating Target 

[3513 A- Coatimplatesx Six wells/library were directly coated with c-MET 
extracellular domain (ECD)/Fc chimera (0.5 |ag/weli) using 100 |LtL/well of 5 iiig/mL c^MET-- 
BCD diluted in TBS[pH 7.53/2 mM CaCla. When using c-MET-ECD/Fc fusion (R&D 
30 Systems; carrier free) as the target^ the plates were pre-coated with Protein A for 1 hr at room 



83 



wo 2006/009888 PCT/US2005/021558 
temperature wifli shaking. When using a biotinylated form of c-MET ECD/Fc^ plates were 
pre-coated with streptavidin for 1 hr at tooth temperature with shakmg. In addition, one 
negative control well/library was coated with TBS[pH 73]/2 mM CaCl2 only. After pre- 
coating was complete, c-MET-Fc biotin) was added and the plates were incubated for 1.5 
5 hr at room temperature with shaking. 

[352] B. Coatim beads: 20 Dynai streptavidui (M-280; Dynal ASA) or 
Dynal Protein A beads (Dynai ASA) were incubated with 5 \x% biotinylated c-MET/Fc or 
non-biotinylated c-^MET/Fc, respectively, in 500 ^iL TBS[pH 7,5]/2 mM CaCla and rotated at 
10 room temperature for 1 hr in Eppendorf tubes. As a negative control, 20 ^iL Dynal 

streptavidin or Protein A beads without target were incubated in 500 jiiL of TBS[pH 7.5]/2 
mM CaCi2 and rotated at room temp^ature for 1 hr. Note that Dynal beads were washed at 
least twice with TBS [pH 7.5J/2 mM CaCl2 before adding target, and beads were coated in 
bnlk. 

15 

2. Biocldng 

[353] A. Blockins Plates: Coating solution was removed and wells were 
washed one time with 200 fxL/well of TBS[pH 7,5]/2 mM CaCla. 250 fil/well of 1% BSA 
20 (protease-free) in TBS[pH 7,5]/2 mM CaCi2 was added md incubated for 1 hr. at room 

temperature with shaking. Alternative reagents (e.g. casein or milk) can be used for biocldng, 

[354] B. Blockim Beadsx Coating solution was removed and beads were 
washed twice with TBS [pH 7.5]/2 mM CaCl2. 500 fil 1% BSA (protease-free) was added in 
25 TBS[pH 7,53/2 mM CaCls and rotated for 1 hr at room temperature. As noted above, 
alternative biocldng reagents can be nsed. 

3. Washes 

30 [355] A, Wash Plates: Wells were washed three times with 200 |iL/well of 

TBS[pH 7.5]/2 mM CaCk to remove excess target. 

[356] B, Wash Beads: Beads were washed three times with 1000 \xL of 
TBS[pH 7.5]/2 mM CaCl2 to remove excess target. Beads were allowed to collect on a 
35 magnet for a few min after each wash to avoid bead loss. 
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4* Phage addition 

1357] A, Phase addition to Plates: About 1000 library equivalents (Al 
domain naive phage library) were added in phage addition buffer (1% nonfat dry milk/ 0.2% 
5 BSA (protease-free), or other appropriate blocking agent, in TBS [pH 7,5]/2 wM CaCIa) and 
incubated at room temperature for 2 hr with shaking. In rounds 2-3, 1 00 p-L total of 
harvested phage was added to 7 wells (6 target + 1 negative control) diluted in phage addition 
buifer. 

1 0 [3581 B. Pha^e addition to Beadsx About 1000 library equivalents (Al 

domain naive phage library) were added in 500 p.1 1 % non-fat dry milk 00 ^1 1 % BSA 
(protease-free) in TBS [pH 7.5J/2 mM CaCl2 and incubated with rotation at room temperature 
for 2 hr. In rounds 2-3, 100 pL total of harvested phage were added to beads. 

15 5* Washes 

1359] A. Wa^him Plates: Tlie plates were washed eight to twelve times 
with 200 lal/weil of TBS [pH 7.5]/2 mM CaCfe/Ol % Tween-20 over a period of 10 min. 

20 13601 B. Washim Beads: The beads were washed 8-12 times with 800 ^1 of 

TBS [pH 7,53/2 mM CaCyOJ % Tween-20 over a period of 30 - 45 min. Bead resuspension 
was facilitated by dispensing wash buffer directly onto collected beads or by pipetting up and 
down (not by vortexing). Alternatively, a KingFisher apparatus (Thermo LabSystems) or 
equivalent can be used for bead washing, 

25 

Conditions for Stringent washes (options) 

[361] a, 800 III of TBS [pH 7.5]/2 mM CaCVO.l % Tween^20 at 37^C; 

30 [362] b. 800 ^il of TBS [450 mMNaCl, pH 7.5]/2 mM CaCl?/0,l % 

Tween-20 at room temperature; 

[363] c. Beads were washed normally 6-8 times, then 1 ]ig of unlabeled 
c-MET-ECD was added for 1 hr at room temperature or 37^C. Phage that remained bound 
35 after this wash were retained for elution/infection; 
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[364] d. 

stringency). 



PCT/US2005/021558 
1% imlk/0,2% BS A/with or without 1 M x^rea/ (high 



6. Competition (optioual): 

[365] A* Competition on Plates: Phage were incubated with 100 [aL/well of 
50 fig/mL (5 )ig/well) of HGF (the c-MET ligand)) in TBS [pH 7.5]/ 2mM Caa2 for 1 hr at 
room temperature with shaking. HGF eluates were retained for infection of BlueKan K91 E. 
coli. 

(366) B, Competition on Beads: Phage were incnbated with 10 HGF in 
500 ^iL TBS [pH 7.5]/ 2mM CaCl^ for 1 hr at room temperature with shaking, HGF elnates 
were retained for infection of BiueKan K91 E. coli, 

7. Phage ElutioH 

|367] A. Elution off of Plates: 1 00 fxL/well of 10 mg/mL trypsin in 

TBS [pH 7.5]/ 2mM CaCk was added, and the plates were incnbated at 37*^0 for 30 min with 
shaking, 

[368} B. BlHtion off of Beads: 100 pL lOmg/ml trypsin TBS [pH 7.5]/ 
2mM CaCl2 was added to beads^ wliich were then incubated at 3TC (in an Eppendorf rack) 
for 30 min with shaking. 

[369) C. AUemative elution/infection: 200 [xL of log-phase BlneKan 
K91 E. coli cells at ODeoo ~ 0.5 were added to each well (for plates) or to aspirated beads. 
The infection was allowed to proceed for 30 min at 37^C without shaking. Next, the 200 jiL 
volumes were pooled and added to - 3 mL of 2xYT/0.2 \x^mL tetracycline and shaken for 15 
min at 37^C. 

8* Infection: (same for Plate and Bead Protocol) 

[370J An appropriate volume of log-phase BiueKan K91 E. coli (in 2xYT/40 
|iig/mL kanamycin) was gi^own to OD^oo ~ 0.5 - 0.6. When the culture reached OD^oOs it was 
placed on ice prior to use, although the time on ice was generally minimized. 

[371] A. In a 50 mL sterile conical tube, eluted phage were mixed with 5 
mL log-phase BiueKan K91 E, coli culture and incubated at 37^C for 25 min without 
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shaking. The sterile conical tubes were covered with AirPore tape (Qiagen) to facilitate 
aeration, 

[372] B, Tetracycline was added to a final coucenHation of 0.2 jag/mL 
5 and shaken for 1 5 min at ST^'C 

[373J C. A 1 0 i^L aliquot was sampled for titeriag and serially dilvited 
10-fold (10^' to 10^) in 2xYT, plated in 8 [iL/dilution spots on 2xYT/20 jig/ml tetracycline 
plates and incubated overnight at 30^C or 37^C, The remaining volume of the 1 0'^ - 10"^ 
1 0 dilutions was plated to obtain single colonies for subsequent phage ELISAs. 

[374] Infected 5 mL cultures were diluted -10-fold into 50 mL 2xYT/ 

20 [ig/mL tetracycline and incubated with shaking at 30^C overnight to saturation. 

I g 9^ Titeting input phage was used in the current round of panniag (same for Plate 

and Bead Protocol) 

[3751 A, 100-fold serial dilutions (lO^^ to 10"^^) of harvested phage were 

made in 2xYT. 

20 

[376] B. 1 00 jiL/well of a log-phase BlueKan K91 £ coU culture at 
OD^oo 0,5-0.6 was added to 6 wells of a 96-weIl polypropylene plate. 

[377] C. 10 of diluted phage was added to the wells containing 100 
25 |LiL of BlueKan K91 E. colL 

{378] D. Phage/cell mixtures were incubated at 37^C for 25 min without 
shaking, and the plates were covered with AirPore tape (Qiagen) to allow for aeration, 

3Q [379J Tetracycline was added to a final concentration of 0.2 ^g/mL 

and the plate was shalcen for 15 min at 3TC. 

[380] 8 \xh of each dilution (10"^ to 10"^^ was plated onto a dry 2xYT 

agar/ 20 \xg/mL tetracycline plate, 

35 

[381] a Plates were incubated at SO^'C or aV'^C overnight. 
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10* Harvesting phage (same for Plat<^ and Bead Protocols) 

[382) A. Overnight cultures were centrifuged at 7000 rpm in disposable 
50 mL tubes for 25 min to pellet cells, 

5 

[383] B. A standard PEG/NaCl phage-precipitation procedure was 
performed by adding 1/5 volume of a 20% PEG/1 5% NaCl stock to culture supernatant. It 
was mixed well by repeatedly inverting and incubating on ice for 45 min to 1 hr. 

10 [384] C, The culture was centrifuged at 7000 rpm for 40 min to pellet 

phage and the supernatant was discarded, 

[385] D. The phage pellet was resuspended in 1 mL TBS [pH 7.5]/ 2 
mM CaCla, transferred to an Eppendorf tube and centrifuged at 13K rpm for at least 2 min to 
1 5 pellet insoluble material . 

[386] E. Supernatant wag transferred to a fresh tube and 1/5 volume of 
PEG/NaCl was added, mixed and incubated on ice for - 5 min. 

20 [387] F. The mixture was then centrifuged at 1 3000 rpm for at least 2 

min, and the supernatant was removed. The pelleted, purified phage were resuspended in up 
to 1 mL TBS [pH 7.5]/ 2mM CaCk and stored at 4°C. 

11. Round 2 and round 3 panning 

25 

[388] The 2"^ and 3'^ round panning conditions were generally the same as in 
Round 1 described above, except the coated target (i,e. c-MET-^ECD) amount was decreased 
2^ to 4-fold for each subsequent round, and the plates (or beads) were washed 2 ~ 4 additional 
times in each subsequent round of panning* 

30 

IIL Optional iutra-doinain recombination 

[389] Monomer sequences in phage display-selected phage pools were 
recombined in the following procedure. This process generated hybrid monomers derived 
35 from mixed halves of the starting monomer collection in a given pool(s). For Al -domain- 
based phage hbraries, the primer pairs SHFl (ATTATGCCCCGGGTCTGGAGGCGTC)/ 
SHBoveriap (CGCCGTCGCAA) and SHFoverlap (TTGCGACGGCG)/B3 
(TCGGCCCCAGAGGCCTGCAATG) were used to PCR-amplify the two halves of the 
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monomers. The 2 halves were ftised together with LA Taq polymerase (Takara), Next, the 
fused hybrid coding sequences were amplified by primers SHF2 
(CCGGATTATGCCCCGGGTCTGGA) and SHB4 

(AAC AGTTTCGGCCCCAGAGGCCTGC). Purified PGR products were digested by Sfil 
5 (NBB) and ligated wift the Sfil-digested fUSESHA phage vector to generate recombined 
monomer libraries. Recombined libraries were panned at least two more rounds against c- 
MET ECD/Fc and screened as described below. Data fi*om characterization of recombined 
monomers is in Tables 1 and 2. 

10 IV. Analysis of panning output (same for plate and bead protocols) 

1390] Phase BLISAs: For each output "phage pool" to be analyzed 
(typically Rounds 2, 3 and 4 ^ if applicable), independent clones were inoculated into 1 mL 
(2xYT/ 20 (ig/mL tetracycline) cultures grown in Costar 96-weIl polypropylene deep-well 
1 5 plates. Inoculating tips were left in, and plates were shaken overnight at 37^C. Cells were 
pelleted by centrifiigation at 3600 rpm for 15 min. Culture supematants were retained and 
ELIS As were performed as described below, 

{391] Non-biotmylated c-MET ECD/Fc (0.1 ^g/well) was directly coated 
20 onto Nunc Maxisorp plates. However^ biotinylated c-MET ECD/Fc, 96-well Nunc Maxisoxp 
plates should first be coated with 50 fiL/well of 50 ^ig/mL (2.5 ^g/well) of streptavidin, 
diluted in TBS [pH 7.5]/ 2 mM CaClz. The plate was incubated at 37^C for 1 lir wifli 
shaking. Plates were washed three times with 200 jLiL/well of TBS [pH 7.5]/ 2 mM CaCl^. 
Wells were blocked with 200 jLiL/well of 1% BSA (fraction V) and tlie covered plate was 
25 incubated at RT for 1 hi* with shaking. The plate was washed three times with TBS [pH 

7.5]/2 mM CaCla. Next, the 96-weU Maxisorp plate was coated with 100 |iL/weli of 1 ^ig/mL 
(0.1 ^g/well) biotinylated c-MET-ECD diluted in TBS [pH 7.5]/2 mM CaCl2 or 100 ^iL/well 
buffer only (negative control). The plate was incubated at RT for 1 hr with shaking. Plates 
were washed three times with TBS [pH 7.5]/2 mM CaClz. Next, 30 p-L of each phage 
30 supernatant is added to wells in the presence of 70 |aL of 1% MilkyO.2% BSA/[pH 7.5]/2 mM 
CaCl2/0.02% Tween-20, Covered plates were incubated at RT for 1 .5 hr with shaking. 

[392] Plates were washed four times with TBS [pH 7.53/2 mM CaCl2/0.02% 
Tween-20. Next, 100 p-L/well of a-MlS-HRP monoclonal antibody (Amersham Pharmacia), 
35 diluted 1 :5000 in TBS [pH 7.5J/2 mM CaCl2 + 0.02% Tween-20, was added. Plates were 
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incubated at 4°C for 1 hr with shaking. fMes were washed three times with cold TBS (pH 
7.5J/2 mM CaCl2/0.02% Tween-20, 100 ^L/wdl of IMB/H2O2 mixture (Pierce), diluted 1:1, 
was added for ELISA development. 

[3931 The reactions were allowed to tiim blue until the strongest ODeso 
5 signals reached ~ 1 .0. The reaction was stopped with 100 jiL/well 2N H2SO4, and positive 
wells changed in color from blue to yellow. Once the reaction was stopped, it was read at 
OD450 on an ELISA plate reader using SoflMaxPro software. 

[394] Phage ELISA-positive phage pools were chosen for subcloning into an 
expression vector if they had (a) a high fiequency of individual phage dones that bound to c~ 
10 Met ECD/Fc and (b) high sequence diversity among the binding-positive phage clones. Pools 
meeting these criteria were chosen for protein screening in the process outlined below. To 
subclone tfc.e monomer or multimer sequences from a ^ven phage pool into the expression 
vector, pEve, approxunately 10^-1 0" phage were amplified by 25 cycles of PGR as foUows: 

PCRredpe 
15 0.5 - 1 JiL purified phage 
5 lOX Buffer 
8 jiL2.5mMdNTPs 

5 1 OuM VS-FOT primer (5'-ATCATCTGGCCGGTCCGGCCTACCCGTATCATGTTCCaQA-3') 

5 ^iL lOuM EveNut primer (5'-aaaagqccccagaggccttctgcaatgac-3') 

20 26HLH2O 

0.5 fxL LA Taq polymerase (1 unit) (Takara) 

Cydes: 25 X [94*'C/10sec.~45^C/30sec.-72V/30sec.] 

25 [395] PGR products were run on a 3% agarose gel for analysis. The 

monomer or multimer product (approxmiately 200 bp) was purified with a QIAqmck spin 
column (Qiagen), digested with Sfi I (NEB), purified agsin with a QiAquick column and then 
ligated using T4 0NA Ligase (NEB) to the Sfi I digested vector, pEve. The ligation was 
transfonned into dectrocompetent BL21 (DE3) E. coli and plated onto 2xYT plates 

30 containing kanamycin at 40 ^g /mL. Following overnight growth, approximatdy 6000 

individual clones were inoculated into 2xYT/kanamydn and grown overnight. Positive and 
negative controls were also included on the plates. 

V. Screening of thousands of moiiomer proteins in 1 mL cell lysates 
35 [396] Protein Produ ction of 1 mL heated Ivsates (Day 1): Individual clones 

were inoculated into wells of a 96-well Costar deep-well plate containing 400 nL/weli of 
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2xYT/ 40 \ig ImL kanamycin. Cultures were grown ovmught (which left inocrfatiiig tips in 
wells) while shaken at 300 rpm at 37^C. This process allowed screening of thousands of 
individual, partially-pnrified monomers at the cell-lysate leveL 

[397J (Day 2) 100 |JiL of overnight cxiltnre was inoculated into new 96- well 
Costar deep-well plate containing 1 mL/well of 2xYT/ 40 \ig/mh Icanamycin 4- 1 mM CaCk. 
(The remaining overnight culture was archived by the addition of 25% final glycerol 
concentration and then stored at -SO'^C for later use.) Plates were covered with AirPore Tape 
(Qiagen) and cultures were grown with shaking at 375 rpm at 37**C until an OD^oo of-- 0.8 to 
1 .0 was reached. Once the desired ODm was reached, cultures were induced with 1 noM 
IPTG for 3 hr while shaking at 375 rpm at 37''C. Plates containing induced cultures were 
then centriftiged for 15 min at 3600 rpm at 4''C to pellet cells. Supernatant was removed and 
discarded^ and the remaining cell pellet was resuspended in 100 of TBS [pH 7.5]/l mM 
CaCl^. Resnspended cells were transfeired from the 96-well deep-well plate to a 96-well 
polypropylene PCR plate and heated for 5 min at SS'^C in a PGR machine. Heated/lysed cells 
were then centrifuged for 15 min at 3600 rpm at 4''C. After centrifugation, protein production 
was complete, and heated lysates were ready for characterization in a primary screen via 
binding ELISA and/or competition AlphaScreen assays. 

[398] C-Met ECD/Fc Protein ELISA : 96-well Maxisorp plates were coated 
with 100 |iiL/well of 1 jxg /mL (0. 1 ^ig/well) c^MET ECD/Fc (R&D systems) diluted in 
TBS[pH 7.5]/ 1 mM CaCfe, and the plate was incubated at 4°C overnight or room 
temperature (RT) for 1 .5 hr with shaking. Wells were emptied and then blocked with 200 
laL/well of 1% BSA (j&action V)/TBS[pH7.5]/l mM CaCl2. The covered plate was incubated 
at RT for 1 hr with shaking. The plate was washed ttoee times with TBS [pH 7.5]/l mM 
CaCla. 100 ^L/well of monomer protein was added to the plate diluted in TBS [pH 7.5]/l 
mM CaCy 0.1% BSA/ 0,02% Tween-20, Protein from 1 mL heated lysate preparations was 
added to the wells as a single point concentration diluted 1:10. Covered plates were 
incubated at RT for 1 .5 hr with shaking. The plate was washed three times with TBS [pH 
7.5]/l mM CaCy 0.02% Tween-20. 1 00 pL /well of anti-HA-HRP detection antibody 
(Roche) diluted 1 :2000 in TBS [pH 7,5]/l mM CaCy 0.1% BSA/ 0,02% Tween-20 was 
added. Covered plates were incubated at RT for 1 hr with shaking. The plate was washed 
three times with TBS [pH 7,5]/ 1 mM CaCy 0.02% Tween-20. 100 jaL /well of TMB/ 
H2SO4 mixture diluted 1:1 was added. Color was allowed to turn blue until OD&50 of the 
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Strongest signals reached ~ LO. The reaction was stopped with 100 jixL/well of 2N H2SO4. 
Once stopped, the plate was read on ELIS A plate reader at OD450. 



[399] AlphaScreen c-Met/Fc-hiotm Ykted fbn) HGF homogeneovts 
com petition assay : All assay components were diluted in AlphaScreen Buffer: 40 mM 
HEPES [pH 7 A] w/ NaOH.l mM CaClz, OJ % BSA (w/v), 0.05 % Tween-20, 100 mM 
NaCl. Thi-ee additions were made to a white, 384-wel], reduced-Yotame, Greiner microtiter 
assay plate with no incubation time in between additions. First, monomers or rtnlabeled 
recombinant humaa HGF (rhHGF) (as positive control) were added to the plate at 2 [iL/welL 
Monomers from 1 mL heated lysate preparations were added to the wells at a single 
concentration (either undiluted [i,e. 1 :4 final assay dilution] or up to a 1 :1 00 dilution [1 :400 
final assay dilution]). As a positive control, instead of monomer protein, 2 jaL/well of 
unlabeled rhHGF (PeproTech) was added to the plate as a twelve-point concentration curve 
starting with 400 nM (i.e. 100 nM final assay concentration) and then 1 :4 serial dilutions 
thereafter with the last point as buffer only. Secondly^ 4 \iVwdl of c-MET ECD/Fc at 0.6 
nM (i.e. 0.3 nM final assay concentration) was added to the plate. Note that the remainder of 
the assay was done in subdued or green-filtered hght as AlphaScreen beads are light 
sensitive. Thirdly, 2 jxL/well of a mixture of bn-HGF at 1 nM (i.e. 0.25 nM final assay 
concentration) and AlphaScreen Streptavidin "donor beads" and Protein A "acceptor beads" 
(PerMi-jEImer) both diluted to 40 [xg/mL (i.e., 10 [ig/mL final assay concentration) was added 
to the plate. The assay plate was then covered with topseai and spun down for - 30 sec at 
800 rpm. The plate was then incubated overnight m the dark at room temperature and read 
the next day on the Fusion Plate reader (PerkinElmer). 

VI* MuUixnerization aad recombination of phage display-selected monomers 

{400] Monomers that have been subcloned into pEve (pEve/monomer) were 
multimerized in the following manner. pEve/monomer plasmids (individually or in pools) 
were digested with either BsrDI or Bpml (NEB). The 1 kb BsrDI and -2,9 Bpml 
fragments were isolated from 1% agarose gels and purified with Qiagen QIAquick spin 
columns. Pools of each of the two fragments were ligated using T4 DNA ligase (NEB); 
subsequently, the ligation was purified with a Qiagen QIAquick spin column. Using the 
primers VS-For and EveNut described in the phage subcloning section above, the multimer 
coding sequences were PCR-amplified fi-om the ligation. The PGR products were purified 



92 



wo 2006/009888 PCT/US2005/021558 

and digested with Sfil (NEB), foUowed by ligation with pEve and transfonnation of BL21 
(DE3) E. coll This method created dimers coHJprised of different combinatiors of ttie 
starting monomers. This method can also be used to generate other multimears, such as 
trimers. When making trimers, pools of pEve/dimers (e.g. in above example) and 
pEve/monomers (the starting collection) are the starting materials. They are processed as 
above. A molecular biology procedure similar to that described below for making "walldng 
libraries" was also used to generate raultimexs. In all cases, proteins were expressed, purified 

and screened as above. 

[401] Additional libraries, referred to as "walldng libraries," were generated 
by ligating phage display-selected monomers (i.e. selected monomers) with the full 
rqjresentation of a naive monomer library. These libraries were constructed in the following 
manner. PCR was used to amplify in two separate reactions: a) the coding sequences of the 
selected monomers with pETF (ACCCGTATGATGTTCCGGATTA) /pETB2r 
(GATGTATTCGGCCCCAGA GGCCTGCAATGAC); andb) the coding sequences of naive 
monomers in a monomer library with 21newl (GAAATTCACCTCGAAAGCAA)/23 
(ATGGGTTCCTATTGGGCT), The -200 bp products were isolated from a 3% agarose gel 
and purified with Qiagen QIAquick spin columns. Each product from (a) and (b) above was 
digested with either BsrDI or Bpml (NEB) in separate reactions. Bpml-digested monomers 
have an overhang which can be ligated to BsrDI-digested monomers. The purified digestion 
products were ligated to one another using T4 DNA ligase (NEB). Ligation of BsrDI-cut 
naive monomers with Bpml-cut selected monomers generates a walking dimer library 
comprised of N-terminal naive monomers fijsed to C-terminal selected monomers. Ligation 
of Bpml-cut naive monomers with BsrDI-cut selected monomers generates a walking dimer 
library comprised of C-teroiinal naive monomers fused to N-terminal selected monomers. 
Primers pETF/pETB2r were used to PCR-atnplify the ligated dimer coding sequences ixom 
the ligation, and the purified products were digested with Sfil followed by Xmal. The 
digested products were Ugated to the phage vector fUSBStlA for the generation of a phage 
display dimer "walking library", typically with 10^40^ unique membei-s. A triraer (or larger 
multimer) "walking library" can be generated in a similar fashion, except that the starting 
materials are diraers (or larger) and naSve monomers. Walking libraries were panned against 
c-MET ECD/Fc and screened ^ described above. 

m Characterization of purified monomers in binding and competition assays 
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[402J Once proteins were characterized at the heated protein lysate level, the 
best monomers were chosen for ftirther characterization. Larger-scale cultures of individual 
clonal were prepared and the monomers, which bear a 6His tag, were pi3xified via Ni-NTA 
resin. These nickel-pmfied monomers were assayed in binding ELISAs and the 
5 Alphas creen competition assay. Protein sequence data and biochemical data from 
chai-acterization of purified monomers are in Tables 1 and 2. 

[403} Protein Purification SOnmT.Ciiltnres forNiNTA: (Day 1) hi a 15 mL 
culture tube containing 3 mL of 2xYT + 40 ^ig /mL kanamydn, the appropriate "primary hit 
1 0 well" archived glycerol stock was inoculated. Culture was shaken overnight at 300 rpm at 
3TC. 

[404] (Day 2) 2 mL of overnight culture was inoculated into 1 L Erlenmeyer 
shake flask containing 500 mL of 2xYT + 40 tig/mL kanamycin. Cultures were grown with 
shaking at 375 rpm at 3TC until an ODfioo of about 0.8-1 .0 was reached. Once desired ODgoo 
1 5 was reached, cultmes were induced with 1 mM final concentration of DPTG for 3 hr while 
shaking at 375 rpm. After 3 hr induction, the 500 mL culture was transferred to 
clean/autoclaved Sorvall tube and centrifuged for 8 min at 8000 rpm at 4°C to pellet cells. 

[405J Once cells were pelleted, supernatant was removed and discarded, and 
20 mL of sonication buffer (1 0% sucrose/20 mM Tris [pH 7.5]/l 50 mM NaCI/0.2 mM CaCk) 
20 was added to each tube. The pellet was resuspended in sonication buffer with 1 0 mL 

serological pipet until there were no visible clumps, and then the resuspended cells (~ 30 mL) 
were transferred into 35 mL Oakridge Tubes and sonicated for 8 min at ~ 1 6 power output. 
After sonication, the warm Oakridge Tubes containing sonicated cells were placed on 
ice/water bath for - 10 min to cool. Once cooled, tubes were centrifuged for 30 min at 
25 1 8,000 rpm at 4^0 to pellet lysed cells. 

[406J While tubes containing lysed cells were being centrifuged, NiNTA 
resin (Qiagen) was washed with Mlli-Q water to remove ethanol. 3 raL of 1 :1 diluted 
NiNTA r(^n/protein was used (i.e. actually 1.5 mL of xesin/protein was used). 3 mL of 
resin/water mix each was added to appropriately labeled (with protein ID) clean 50 mL screw 
30 c^ tube. After sonicated cells were pelleted, protein supernatant was removed and added to 
50 mL tube containing the 1 .5 mL of washed NiNTA resin. Protein was allowed bind to 
NiNTA resin by rocking gently for 0.5 hr. @ RT. After incubation with NiNTA resin, 
centrifuge 50 mL tubes with NiNTA were bound to protein for 10 min at ~1 500 rpm. 
Supernatant was gently poured out and discarded. 
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[407] NiNTA resin + bound protein was transferred to ^propriately labeled 
1 5 mL Clontech columns by adding 1 mL of NiNTA Wash Buffer (20 mM Tris [pH 7.5], 200 
mM NaQ, 0.1 mM CaCl^, 20 mM imidazole) to 50 mL tube containing resin, swirling to 
resuspend, then pipetting the mixture into a coluran which has been moimted on a vacuum 
manifold. NiNTA resin + bound protein was washed with at least 10 column volumes (15 
mL) of NiNTA wash buffer. 1 5 mL columns containing NiNTA resin + bound and washed 
protein was transferred to clean 1 5 mL screw cap collection tubes. 4 mL of Ni Elution buffer 
(20 mM Tris [pH 7.5], 200 raM NaCl, 0.1 mM CaQa, 200 mM imidazole) was added to each 
column to elute off protein into the 1 5 mL collection tube. It was then aUowed to elute by 
gravity. 

[408] Eluted protein was transferred to sUde-A-lyzex cassette (appropriate 
1^ cutoff—for monomers used 3.5 kDa cutoff and for dimeis and trhners iTsed 10 IdDa 
cutoff) using 18.5 gauge needle and 5 mL syringe to load cassette. Slide-A-lyzers containing 
eluted proteins were placed into overnight dialysis buffer containing redox reagents (20 mM 
Tris [pH 7.5], 100 mM NaCl, 1 raM CaCl2, 1 mM 2-mercaptoethanol, 0.25 mM 2- 
hydroxyethyldisulfide). 

(4091 (Day 3) Siide-A-lyzer cassettes containing overnight dialyzed proteins 
were transferred into dialysis buffer without redox (20 mM Tris [pH 7.5], 100 mM NaCl, 1 
mM CaCU), After 3 hr dialysis, slide-A-lyzer cassettes were transferred into fiesh TBS/ 
CaCfc without redox for another 3 hr. After 2"^ dialysis change, proteins were removed from 
slide-A-lyzer cassettes using 18.5 gauge needle and 5 mL syringe, and protein was 
trar^ferred by filtering using 0,2 micron syringe filter into appropriately labeled 15 raL 

polypropylene tube. 

[4W] The anti-c-MET NiNTA purified proteijK, which were selected as the 
"best inhibitors" in AlphaScreen competition assays, were further purified by Q-^epharose 
anion exchange to remove contaminants. Q-Sepharose Purification : 1 mL of Q-Sepharose 
Fast-Flow Resin (Amersham Biosciences) w^ added to 15 mL Clontech column. Resin with 
15 column volumes (or 15 mL) of 20 mMTris [pH 7.5], 50 mM NaCl, 1 mM CaCh was 
equilibrated. 2 mL (~ 5 mg) of filtered NiNTA-purified protein was added to resin and 
protein was allowed to bind to resin by gravity. How-throu^ into first column of 96-well 
plate was collected. Columns loaded with protein were transferred to 15 mL collection tube 
and resin/bound protein were washed with 1 0 column volumes (or 1 0 mL) of 20 mM Tris 
[pH 7.5], 50 mM NaCl, 1 raM CaCk. Once washed, NaCl gradient elution of protein was 
started. NaCl concentration was varied in gradient as follows: 1 00 mM, 150 mM, 200 mM, 
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250 mM, 300 mM, 350 mM, 400 mM, 500 wM, and finally 1 M NaQ, to a base of 20 mM 
Tris [pH 7.5], I mM C&Ch. Fractions were collected in 96~well deep-well polypropylene 
plate— 2 mL/fi-action, in 1 mL increments. Fractions containing protein were tested by 
Bradford and analyzed by SDS PAGE. Fractions were tested in binding ELISAs and 
competition assays as described above with the following change. Protein from 500 mL 
NiNTA purified preparations or NiNTA + Q-sepharose purified preparations was added to 
the plate as a twelve-point concentration curve starting with a 1 :5 to 1 :100 first dilution and 
then 1 :4 serial dilutions liiereafter with tifie last point as buffer only. Protein sequence data 
and biochemical data from characterization of purified monomers are in Tables 1 and 2. 

Table 1. Aati-c-MET Avimer (M) and Recombined (Rec) Monomer Sequences. Note 
that some proteins isolated from the recombined monomer library are dimers. 
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Protein 



ID 



Construct 



ML 



Ci29 



cm 



CF5PEPKCHSTGIlClPVBmCDGVNPCEPaSPEASCSAPASB?rGSL$r^ 



CEAMEFRCtCSTaRCiSOTWRCDGDDPCEaSSDEA^iCKPPTSLQ 



cm 



C132 



CLSt^FRCS$T'GRCLPIU'V/VCPGD!C>CE&GSDEA^AlCGRFGPGATSAPAASUL 
CMTTQFSCRSTHRCiPLPV/QCDOVTSCEDNSDBASCSAPASafPGSLSLQ 



M06 



CI33 



CPFDEFrCRSrrBRaPLAWVCDGD?^QCEDSSI>EACCTFFErrSLQ 



CB4 



MPS 
M09 



Sequence 



C13$ 



CLASEFTC}^SrrORC^PETWVCPGVMDCEDSSP£APCORP0PGA-rSAPAASLg„ 



MIO 

M3i 



CQPDEFTCNST6RCJ?PPWVCDGVI>PCEDC?6DETGCSQP?BFHKVSLQ 



CLASEFrCHSTGRCtPE-nVVCDGY»DCBDS$t?El>NAHCGRPG?GATSAPAASlQ 



Mt2 



6VFN£rfcSSfORCtPKA\VyCDGVPPCEP 



C241 . 



^BFPCRSTGRCVPLTWLCI>GPNI>CEDGSDBASAtCORPGPOAT^^ 



M16 



CAl'SEFTCNSTOllClPQEWyCDGPNPCBDSSDEAP&LCASAAPTSLQ _ 
CllAT^E!FQCB$TGR€JPg^W^CDGDNPCBPGS&EACCAASGPTSLQ.. 



cesnefocqstsrcipltwrcpovndcepgspea-nctaavhtslql 



M20 



CBSSEFRCRSTGHaFGGWLCPGPNPCBPSSDETPCSAPASBPPGSLSLQ 



cm 



CE APEFRCR5TCRC1SVPWRCDGVSPCEPSSP££SC£$TAPTSLQ_ 
CVSNErrCRSTKKCVPQBWVCPOVNPCEDGSDBTCCFKBTSLgZ 



M23 



CPSGEFQCRST^aaPErWLCDGPNDCBDGSPEBSCTPPTStQ 

CAA>3EFOCHSTGRCJPLSV/VC0GVNDCEP5$PETNCRAFTSBPKG$VgLQ_ 



U2A . 
M25_ 



CPPbEFRCTSTGRCiPJlAWLCHGPNPCj-PSSDBKPCKPHlSLQ 



C3S6 



CISGEFQCHSTGKCIPASWLCI>GDr^DCEDGSDESOLCTAHT$tQ 



c6sFTE fecHStGKClPAMgP^™P^^^^^P 
PVASBFTClsfGRCiPESWRC^^ 



M23- 

m9 



C3S9 



CG$SBFOCHSTGHC3PENWVCDGPDPCBDSSDEKSCTSAAPT5Lt^ 



CQAGQFECRSrHKClPO£>WVCPGVMPCEDSSDEESCTSPARTSLQ 



COAGQFECIQ^i'mRCiPQPWVCPOVMPCEDSSDEESCTSPARTSLQ 



CU^SEPQCKST^RClPOAWLYPGV^iPCEPGSPBTNCSAPASEFPGSlSLQ.. 



CQPI^EFQCHSTGRClPASWLCPGPMPCEPGSDBSQLCTAHTaLQ 



M32 



C394 
C3?5 



^APBFQCr^grGRCIPVSWVCpGVMPCBPSSPBAGCATSGPTSLQ.. 



CPSSQn-CHSTRRClPQTWVCbGiDN^ 

CLA^^EFRCN£TGRCIr'RAWLCDGP^lDCEPSSPEKPCKQ}^TSLQ■ 



M35 
M37 



CSSPBFQds^SRClPREV/VCPGVMPCEgOSDEALArCTSTAHTSLQ 



C39S 



CBSKEFQCBSTSRC1PITWRCDRVMDCBPG5 DEA>^CTAAVHISLQ 



M3S 
M40 



C40i . 



C GANBFICQSTNRCtFQSWVCPGVI^&CEPGSPESPVLCATTVHTStQ 
CVSNEFTCRSTKKCVPOEWVCPGVtiPCEBGSDBTGCPKHTSLQ 



CTS b3EFPCOSTDRC!PRSWKCPOP^iPCEDG^PBKPCSAPASEP?qS[_^L5L_ 
LFSEFQqCSTMRCIFQAWlYPGTODCEPGS^^ 



RccMOt 



CPAOQS^CRSTORCJPLQWVCDGPNPCBPSSDESPAlCATTGPTSLQKASAAyPYTDVPDYAPGLEASGGS 
CBgNBFQCRSTORCVPVAWVCPgDMPCEPSSPBKNClWTSLQ 



C410 



CBSNErOCQSTSRClPLQWVCDODNPCEPSSPEASCGCPOFGAT SAFAASLQ 



{(Itlliier) 



CHAFrQFBCKSTGRCIPLTWCPGPNPCBPGSPHKPCGPSHJLPFSTFGPSTSLQKASAAYPYDVFPYAPGLEASG 
CDPJAEFK CHSTGRCIPLPmCPGVNPCBPSSDHSPAHCSAPASBPPGSLSLQ 



RccMOS. 



C413 



CHPIABFECHStGRClPVDWLCT 



RccMO& 



C414 



CQASPQFECKSTGRCIPLAWCDGDNDCEDGSPBSPATCORPGLEASOGS 
Cl^EFQCHST0RCIPASWLCPGP>iDCBDGgPEASCGia>.0FGGT5APAAg^^ 



RccM&g 



CAAPEFQCNSTGRCIPVNWLCPGPNDCEPSSPEENCSAPASEPFCSLSLQ 



_C416 
C4I7 



CQSFrEFECHSTGRCIFVDWLCPODNDCEPSSPESPAtCSAFAS&PFGSLS!^ 



CBSNEFQCRSTGRClPVSWVCpqP^DCEPSSDEASCGPSHILFPGTPGPSTSLQ 



RccMlO 



cm 



CRAI^BFQCHSTGRCIPASWLCPGDNDCBDSSDBAPPLCASAAPTSLQASGLEASGCS 
CHAPT0FBCRSTGRC]PAAWVCDGP^IDCEPGSPESFAICGRPGLGATSAPAAgL5 



RccMH 



C419 



CLAMBFTCRSTGRCJPLQWVCPGPNPCEPSSPBKGCGPSHILPGLEASGGS 
CFASQ£'PCRSTORCIFA£WVCP0PM>CEPSSPBASaGPSHlLPrSTFGFSTSL<3L^ 



Rct:Mi2 



C420 



CESKEFQCQSTSRCIPLTWRCDOPNDCEPSSDBKSCSAl^ASEPPGLEASGGS 
| CPASEFTCmGRCISQGWVCPGPNDCEPSSPESFArCATTGPTSLQ 



^dLrt C42) CASSBFRCRBTGRCJPQRV/VCPODNPCEDGSPE'mCGPSmLPFSTPGLEASGGS 
' IcO TGBFRCRSTPRCIFAEWVCPGPSPCBPGSPBTN CGPSHlLFFSTPGFSTSLQ 
C4r5 IcEPPB^QCRSTGRCIPlEWLCDGDNbC^^^ 



RccMU 



Table 2. AntU-MET MoBomer and Recombined Monomer Binding Kd and 
5 Biochemical IC50 data. Note that some proteins isolated from the recombined monomer 
library are dimers (noted in Table 1), Blank entries indicate data not available. 
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protein fl) 


Construct ^ 


AJphaScrcen IC50 
NiNTA-ptire (jiM) 


Binding ELISAKiJ 


AiphaScreen IC5& 
Q-ptire (nM) 


Bijiding MASA Kd 
Q-pwre (nM) 


MOl 


C128 


120 


105 






M02 


cm 


196 


172 






M03 


C130 


129 


58 


328 




M04 


C131 


197 


251 






M03 


CI 32 


392 


69 






M06 


C133 


27 


40 


19 




Ma? 


Ci34 




121 






MOB 


CI 35 


80 


130 






M09 


C136 


SI 


185 






MIO 


C137 


256 


439 






MU 


C237 


4: 


MA 


3S6 




un 


C238 


79 


NA 






M13 


C239 


247 


NA 






MM 


C240 


293 








M15 


C241 


81 


NA 






M16 


C377 


15 


46 


27 




Ml? 


C378 


36 


79 


165 




un 


C379 


58 


U3 






Ml? 


C380 


35 


82 


Ul 




M20 


C38I 


22 


158 


186 




M21 


C382 


83 


116 






U22 


C3S3 


34 


66 


57 




. M23 


C384 


43 


138 






M24 


0385 


57 


77 






M25 


C386 


n 


74 


29 




M26 


C337 


13 


100 


16 


5,8 


M27 


C3S8 


42 


93 


105 




M2S 


C3S9 


30 


31 


152 




M29 


C390 


23 


43 


162 




M30 


C391 


34 


73 


161 




M31 


C392 


19 


99 


145 




M32 


C393 


20 


232 


69 




M33 


C394 


42 


157 


185 




M34{dimer> 


cm 


3 


556 


2 




M35 


cm 


2S 


93 


290 




M36 


C397 


52 


79 


213 




M37 


C39S 


28 


85 


232 




M38 


C399 


50 


95 


881 




M39 


C400 


35 


86 


85 




UAi) 


C401 


21 


240 


97 






C409 


2 


2.4 


1 


0.3 


RecM02 


C410 


2 


20.3 


2 


344.3 


RecM03 


cm 


2 


2,7 


3 


0.2 


RecM04 


C412 


24 


16.4 


94 


126,2 


RecMOS 


C413 


15 


9.7 


23 


2.0 


RecM06 


C4H 


1 


0.4 


2 


0.7 


RecM07 


C415 


i 


3.5 


13 


5.4 


RccMOS 


C416 


8 


393 


16 


24,6 


KccM09 


C417 


19 


422 


52 


5L6 


RecMlO 


C4I8 


1 


2.0 


2 


3,9 


RecMU 


C419 


1 


Li 


1 


03 


KecM12 


C420 


i 


3.8 


0,33 


03 


RficM13 


C421 


3 


2.5 


2 


0.2 


RecMi4 


C422 


11 


7,9 


15 


44,6 


Control ;■ 


rhHGF (R&D) 


0,152 









Example 7 

5 [411] This example describes experiments demonstrating inhibition of HGF~ 

induced cell proliferation by c-MET-binding monomers. 
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[412] HGF is a potent stimtdator of epithelial cell proliferation. The iise of 
A549 human Iraig adenocarcinoma cells in assays of HGF-induced proliferation for 
detenniJHiig efficacy of HGF and/or c-MET itdiibitors is well established in the art. For the 
puiposes of &ese experiments a single cell clone of the A549 cell line, termed A549-SC, was 
derived by limiting dilution. The A549-SC done was selected on the basis of its strong cell 
scattering response in the presence of HGF. 

[413] A549-SC cells were plated on collagen-coated 96 well plates (1x1 0'* 
cells/well) in 1 00 pi serum-free F42 medium per well, then incubated for 48 hr at SVC in 
5% CO2. After 48 hr, medium was removed from the wells and replaced with dilutions of 
monomer, in a volume of 50 iml per well serum-free F-12 medium. After 1 hr incubation at 
37°C in 5% CO2, 50 ^1 serum-free F-12 medium supplemented with 40 ng/ml recombinant 
human HGF was added, to give a final concentration of 20 ng/ml HGF, the EC50 for HGF. 
The plates were incubated for a fiirlher 48 hr at ZTC in 5% CO2, then pulsed with 2 fiCi 
iiitiated methylUiymidine per well for a furflier 1 5 hr. After pulsing, medium was removed 
and replaced with 200 ^il 0.05% trypsin per well, and Uie plates were incubated at 37"C for 5 
min. The labeled cells were then harvested to a glass fiber filter using a Tomtec Harvester 96. 
Incorporated label then measured by scintillation counting. 

[414] A recombinant fusion of the extracellular domain of human c-MET 
with immunoglobulin Fc domain (c-METFc) was used as aposMve control on these 
experiments (R&D Systems). A titration of c-METFc was mixed with recombinant human 
HGF to a final concentration of 20 ng/ml HGF and incubated at ZTC for 1 hour. This 
mixture of o-METFc and HGF was then added to seram-starved A549-SC cells in a 96 well 
plate, These cells were then preceded in the same fashion as those treated with the 
monomers ormultimers. 

[415] Figure 9 shows a comparison between c-METFc, a c-MET-specific 
monomer (M26) and a c-MET-specific dimer (KM12; RecM12) with regards to their relative 
abilities to block HGF-induced proliferation of serum-starved A549-SC human lung 
adenocarcinoma cells. 

[416] The IC50 for dimer RM12 is 0.32nM. The IC50 for c-METFc is 
1 .73nM. (n=3 for all data points.) Monomer M26 showed little detectable inhibitory activity 

in this cell-based assay. 

[417] This assay provides a means to screen monomers or multimers for anti- 
c~MET activity in an in vilxo bioassay using human cells. By detennining IC50 values for 
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tested multimers or monomers, optimal molecules can be identified and ranked on the basis 
of their biological activity. 

Example 8 . 
5 [418] Tte example describes experiments demonstratmg monomer bmdmg 

to a c-MET-expressing human cell line. 

[419] Monomers were constructed to include an influenza hemagluttinin 
(HA) epitope tag. This enables the monomer to be used ss a primary flow cytometry 
detection agent, with a fluorescent-tagged anti-HA secondary antibody being used as the 

1 0 secondary detection agent 

[420] 1 5 monomers selected by panning against c-MET were tested for tie 
ability to bind A549 human lung adenocarcinoma cells, a o-MET-expressing cell line. Jurkat 
T cells were used as a c-MET-negative control cell line. 

[4211 The adherent A549 cells were harvested ftom tissue culture plates 
1 5 using 10 mM EDTA in phosphate buffered saline (pH 7.4). Jurkat T cells were removed 
from culture medium by centrifagation. To determine monomer binding, 2.5 X 10* cells 
were stained with 10 \M c-MET monomer in 100 \i\ flow cytometry staining buffer ("FACS 
buffer:" PBS pH 7.4, 5% fetal calf serum, 0.01% sodium azide) on ice for 30 min. Cells were 
washed once with 4 ml ice-cold FACS buffer, then resuspended in 100 \i\ FACS buffer plus 
20 0.2 )ig FITC-conjugated anti-HA monoclonal antibody (Santa Cruz Biotechnology) and 
incubated on ice for 30 min. Cells were washed once with 4 mi ic&-cold FACS buffer, then 
resuspended in 200 ffl FACS buffer and analyzed using a FACSCalibur Flow Cytometer (BD 
Biosciences). Data were collected and analyzed using CellQuest Pro (BD Biosciences). The 
geometric mean fluorescence was determined for both A549 and Jurkat T cells, and 
25 normalized against the geometric mean fluorescence for that cell line stained with FITC- 
conjugated anti-HA monoclonal antibody alone. 

(4221 The following illustrates the prefetential binding of c-MET-specific 
monomerss to the c-MET positive A549 cells rather than to the c-MET negative Jurkat T 
cells. 
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Monomer # 


Geometric Mean FlTiorescence (Fluorescem Channel) vs. 
Secondary Antibody Alone 

A549 Juritat 


Seconflaiy AnuDoay moiic 


1 


1 


•t 
1 


1.19 


0.97 




1.69 


1.01 




1.60 


1.05 


A 
t 


1.77 


1.05 




1.72 


l.ll 


<; 
o 


1.72 


1.09 


7 


1.70 


1.11 


ft 
o 


1.46 


1.05 


y 


1.48 


1.05 


1 A 


1.56 


1.04 


11 


2.03 


1.07 


12 


2.42 


1.12 


13 


2.41 


1.10 


14 


2.80 


1.13 


15 


2.22 


1.12 



[423) These data show that anti-c-MET monomers bind to A549 huraan lung 
adenocarcinoma, a c-MBT positive cell line, but not to JtJrkat T cells, a c-MET negative cell 
line. This flow cytometry based method has utiHty in confirming specific monomer binding 
to target in the context of other cell surface proteins. In addition to demonstrating that 
monomers bind to native c-MET, this method also shows that monomers exhibit little or no 
non-specific binding to cells. 



Example 9 , . , 

I Q [424] This example describes experiments designed to show monomer 

inhibition of HGF-induced cell scattering. 

[425] HGF was identified as 'scatter factor', inducing a motile phenotype in 
epithelial cells. On addition of HGF, epithelial cell clusters break apart, and the cells migrate 

away fiom each other, or scatter. 
1 5 [426] A single cell clone of A549 human lung adenocarcinoma (termed 

A549-SC) was isolated by limiting dilution on the basis of forming tight clusters which 
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scatter over fhe coiase of 24 lir after addition of recombinant human HGF, This clone was 
used in all subsequent experiments. 

[427] A549-SC was plated at 25 cells/well in a 96 well plate in F-1 2 medium 
supplemented with 10% FBS. CeUs were cultured until clusters of 20 to 30 ceUs were 
5 visible, approximately 4 days. 

[428) After 4 days, medium was removed IBrom the cells, and replaced with 
monomer dilutions in a volume of 50 jxl/well of serum-free F-12 medium. In addition, a 
recombinant fusion protein of the extracellular domain of human c-MET with 
immunoglobulin Fc domdn (c-METFc) was used as a positive control on these experiments 
10 (R&D Systems). After 1 hr incubation at iTC in 5% CO2, 50 ^1 per well 40 ng/ral 

recombinant HGF in serum free F-12 medium was added, to give a final concentration of 20 
ng/ral HGF. Control wells lacMag HGF were also included. iTie plates were Uien incubated 
for 24 hr at 37°C in 5% COz. After 24 hr medium was removed from the plates, and fhe cells 
were fixed with 100% methanol for 15 min at room temperature, and then stained for 1 hr at 
15 TOom temperature with 0.2% crystal violet in 30% ethanol. Stained ceUs were washed with 
phosphate-buffered saline, and then photographed. 

[429] Twenty ng/ml (^proximately EC50) of the o-MET ligand induces a 
cell scattering response in A549-SC cells; a monomer of irrelevant specificity (negative 
control) as expected did not inhibit Ms scattering response. In contrast, both 0.5 c-MET- 
20 Fc (positive confrol) and 1 ^iM of an anti-o-MET monomer Avimer partially reversed the 
HGF-induced scattering response. These data illustrate that a anti-c^MET monomer can 
inhibit fhe scattering response to at least a similai- extent as a compm-able concentration of a 
positive control inhibitor c-MET-Fc. 

25 c-MET blading monomers & dimers 

[430] The following provides a summary of the c-MET monomers identified, 
grouped by sequence homology. There are 10 families, wherein members of the same family 

have related sequences, 

[431] The information can be summarized as follows. Sequences in brackets 

30 {"[]") inidicate alternate amino acids at a single positioa 
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Motif for all 10 families i. 

Family consensus ^equence^ (periods indicate any amino acid; spacing i 

merely for alignment purposes. One row includes one contiguous polypeptide) 

w . cdgdndced , sde , 
cdg . ndced. sde . 
w , cdg * . deed . sde . . 
w . cdg . ndcedssde . , c 

dc * d . sde , 

w . c * g . ndced. sde , 
w.c. . * .dc.d.sd, . , - 
w * cdg , ndc , d . s . e , . . , c 
w*Cvg^ ,dc.d, sde. 
w^cdg . . dc, d. sde. 



Fam 


1 




[eq]f .c. 


St .r 


c[iv] ... 


Fam 


2 


c . * . * 


[eq] fee. 


st , r 


c[iv] 


Fam 


3 


C, t . . 


[eqlf .c. 


st , r 


c[ilv]p. . 


Fam 


4 




teqjfqc. 


st .r 


c [iv] p. K 


Fam 


5 


c. . * 


[eq]f .c. 




c[ilvl . , , 


Fam 


6 


c * * . 


[eq]f .c. 


stgr 


c. p. . 


FaJH 


7 


c » . . . 


[eq]f .c. 


st ,r 


c[ilvl ... 


Fam 


8 


C, . 4 


[eq] f .c. 




c[ilv] . . , 






c . * . . 


[eq]f .c. 


st.r 


c[llv]p. . 


Fam 


10 


c, * » . 


[eqjf.c. 




ctilv] . , . 



Natural A-^domaxns; 

c{ ,) . . . . f vC, c[ilv] 



, . cd . . . dc , d , sde . ( . 



Al library: 

c..( J 

a 

P 
s 



q 

k 



f .c. 



1 
1 

V 



, , cdg . . dc . d. sde c 

p 11 
s wr en 

V vd 



[432] Based on family 1 0 alignments, the invention provides polypeptides 
comprising non-naturally occurring monomer domains that bind c«MET and that have the 
sequence GR or KR immediately preceding the third cysteine in an A domain scaffold. 

14331 Details of each c-MET^hinding family follows. Dashes ("-") are 
inserted for alignment purposes and do not represent positions in the proteins 

CAPSEFTCNSTGRCIFQEWVCDGDl^DCEDSSDEAPDLCASAAPT 

CAPSEFTCNSTGRCIPQEWVCDGDNDCEDSSDEAPDLCASAAPT 

CAFSEFTOTSTGRCIPQBWVCDGDNDCEDSSDEAPDLCASAAPT 

CAPSEFTCTSTGRCIFQEWVCDGDNDCEDSSDEAPDLCASAAPT 

CAPSEFTCNSTGKCXPQEWVCDGDTslDCEDSSOEAPDLCASAAPT 

CAPSEFTOTSTGRCIPQEWVCDGDWDCEDSSPEAPDLCASAAPT 

CAPSQFTCI^ISTGRCIPQEWVCDGDNDCEDSSDEAFDLCAIAAFT 

CLa!:?EFTCRSTGRCIPQTWVCDGDI^IDCEDSSDEAPDLCA$AAPT 

CESi^EFQCRSTl^RCIPLQWVCDGDWDCEOSSDEAPDLCASAAPT 

CESNEFQCSSTGRCXPQMVCDGDl^DCEDSSDEAPDLCASAAPT 

CRA!:^EFQCHSTGRCIPASWLCDGDNDCEDSSDEAPDLCASAAPT 

CSPNEFQCRSTGRCISLAWVCDGDNDCEDSSDEAPALCK?\SVPT 

CPASEFTCRSTGRCISQGP^VCDGDNDCEDSSDESPAICATTGPT 

CPAGQFTCESTi^RCIPLQWVCDGDlSjDCEDSSDESPAICATTGPT 

CPASgFTCRSTDRClPXAWVCDGDHDCEDSSDESPEICSAPASEPPG 

CQASQFTCESTGRCIPLDWVCDGDDDCEDGSDESPEICAAPAPT 

CESNEFQCRSTGRCVPLSWVCDGDHDCEDGSDESPAICKTPGHT 

CESNEFQCHSTGRCIPQAWI.CDGDt<lDCEDSSDEAPAICKTPGHt 

CRST^EFTCRSTERCIPLGWCDGDNDCEDSSBEAPXIRKTPGHT 

CPMEFKCHSTGRCISLAOTCDGDKDCEDSSDEKS--CRGPGHT 



Fam2 



CQSFTEFECHSTGRCIPLQWVCDGDNDCEDSSDESP- 



-ATCATPGHT 
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CQSFTEFECHSTGRCIPASWLCDGDNDCEDSSDESP ^-ANCATPAHT 

CQSFTEFECHSTGKCIPVEWLCDGDI>]DCEDSSDSAP AXCKTPGHT 

CQSFTEFECHSTGRCIPASWLCDGDMDCEDSSDE EGCSAAAFT 

CQSFTEFECHSTGRCIPVD^^LCDGDNDCEDSSDE KDCKQ—HT 

CQSFTEFECHSTGKCIPVDWLCDGDNDCEDSSDE KDCKQ— ET 

CQSFTEFECHSTGRCIPRTWX.CDGDlSiDCEDSSDE KDCKQ~HT 

CQSFTEFECHSTGRCIPVDWLCDGDNDCEDGSDE KSCFA— HT 

CQSFTEFECBSTGRCIPVDWLCDGDNDCEDSSDE KNCQP—PT 

CHPTAEFECHSTGRCIPVDl>?LCDGDi;iDCEDSSDE Kl^lCKA— HT 

CHPIAEFECHSTSRCIPETWLCDGDlNfDCEDSSDS ANCQP— PT 

CHPTAEFECN8TGRCVSADWLCDGDNDCEDGSDESP ALCK—APT 

CHPTAEFECTSTGRCVSADWLCDGDI^IDCEIDGSDESP ALCK—APT 

CHPTASFECTSTGRCVSADWLCDGDNDCEDGSDESS APCETTGPT 

CHPTSEFECRSTARCIPLTWVCDGDNDCEDSSDEK HCOPP--T 

CHPTSEFECRSTAKCXPLTOTCDGDI^DCEDSSDEAP AICKTFGHT 

CHAPTQFECRSTORCXPLQWVCDGDlslDCEDSSDE TGCAK—PT 

CHTPTQFECRSTGRClPLEWLCDGDt^DCEDSSDE TGCAK—PT 

CHAPTQFECRSTGRCXPLQWVCDGDl^DCEDSSDES LATCQQ— HT 

CmPNQFECRSTSRCIPLGWCDGVHDCEDSSDE TDCQE--FT 

CHAPTQFECRSTGRCIPRDWVCDGDI^DCEDSSDEA SCGAPG— PT 

CQASDQFECKSTGRCIPLAWRCDGDNDGEDGSDESPAICGRPGLEASG— GS 

CQA3DQFECKSTGRCIPI.AWRCDGVNDCEDGSDE AGCAASG— PT 

CQASDQFECKSTGRCXPLDWLCDGVNDCEDSSDE ALEI^CA-QHT 



Fam3 

CG-SSEFQCHSTGRCXPENWVCDGDDDCEDSSDEK— SCTSAAPT 

CG-SSBF0CHSTGRCIPENWVCDGDDDCEDSSDEK--3CTSAAPa: 

CG-SSEFQCHSTGRCIPE1SIWVCDGDDDCDDSSDEK--SCTSAAPT 

CE-S]^EFQCQSTGRCIPRTWVCDGD]^DCEDSSDEK--SCTTPAPT 

CE-ST>fEFQCRS'rGRCVPVAl^VCDGDlS!DCEDSSDET--GCKAPT 

CE-SNEFQCRSTGRCVPVAWVCDGDl^DCEDSSDET--GCAKPT 

CE-SNEFQCRSTGRCVPVAWVCDGDJ>iDCEDSSDEK— 1^ICKAHT 

CE-SMEFQCRSO'GRCVPVAWVCDGDNDCEDSSDEK— NCKAPT- 

CE-3MEFQCRSTGRCVPVAWVCDGDNDCEDSSDEK~-DCSAPASEPPGSL 

CE-SNEFQCRSTGRCVPVAWVCDGDNDCSDSSDEA— l^CGDSHILPFSTPGPST- 
CE-SlslEFQCRSTGRCVPVAT^VCDGDNDCEDSSDEK—DCGDSHILPFSTPGPST- 
CE-SNSFQCRSTGRCIPVSWVCDGDNDCEDSSDEA— SCGDSHILPFGTPGPST- 

CE~SNEFQCRSTGKCVPVAWVCDGDiSlDCEDSSDEA--SCG ■ APGPT— 

CE-ASEFTCRSraRCIPVDWVCDGDNDCEDSSDEK— GCGDSHILPFSTPGPST" 
CE-ASEFTCRSTNRCXPVDWVCDGDNDCEDSSDEK— GCGDSHILPFSTPGPST- 
CE-ASBFTCRSTNRCIPQDWVCDGDNDCEDSSDEK— GCGDSKILPFSTFGPST- 
CE-ASEFTCRSTHRCIPLQWCDGDWDCEDSSDEA— NCGDSHILPFSTFGPST- 
CP-AGQFTCRSTNRCIPLQWVCDGDNDCEDSSDEA—NCGDSHILPFSTPGPST- 
CE-ASEFTCRSTMRCIPAI^^WVCDGDNDCEDSSDEA— IsICGDSHXLPFSTPGPSX- 
CE-PSQFTCRSTSRCIPRWLCDGDlSiDCEDSSDEAPALCGDSHlLPFSTLGPST" 
CL-SSEFTCKSraRCIPRAWVCDGDi^DCEDSSDEAPALCGDSHXX>PFSTPGPST- 
CX"XSQFXCKSTGRCIPAEWVCDGDr^DCEDSSDEA— SRGDSHXLPFSTPGPST- 
CQ-ADQFQCRSTSRCXPAPWVCDGVJ:3DCEDGSDET— SCGDSHXLPFSTPGPST- 
CR-ADQFQCRSTMRCLPGPWVCDGVNDCEDGSDST— GCGDSHXLPFSTPGPST- 
CQ-TGEFRCRSTDRCIPAEWVCDGDSDCEDGSDET—NCGDSHXLPFSTPGPST"- 
CA-SNEFRCRSTGRCIPQRWVCDGDJ^DCEDGSDET^-^^CGDSHXLPFl^TPGPXT- 

CQS FTEFECHSTGRC I PVDWLCDGDNDCED3 S DEK—GCGDSHXLPFSTPGP ST- 
CQSFTEFECHSTGRCIPAEWCDGDD^DCEDSSDEK— GCGDSHXLPFSTFGPST" 
CHPTAEFECHSTGRCIPVDWLCDGD1^DCEDSSDEK--GCGDSHXX.PFSTPGPST- 
CHAPTQFECRSTGRCIPLTWVCDGDWDCEDGSDEK— DCGDSBILPFSTPGPST- 
CX-PSEFTCKSTGRCIPLDWCDGDt^DCEDSSDEK— GCGDPHXLPFSTPGPST- 
CA-ADEFQCNSTGRCIPVSWVCDGDNDCEDSSDEK—GCGDPHXLPFSTPGPST™ 

CL-ANEFTCRSTGRCXPLQWCDGDNDCEDSSDEK— GCGDSHILP GLEAS 

CE-ASEFTCRST!^RCXPLQWCDGDI^DCEDSSDEK— GCGDSHXLPFSTPGLEAS 
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CA-SSErRCRSTGRCIPQRWVCDGDNDCEDSSDSK~~GCGDSHXLPFSTPGLEAS 
CA-SSEFRCRSTGKCIPQRWVCDGDt^lDCEDGSDET— NCGDSHILPH'STFGLEAS 
CR-SNEFTCKSTGRCIPRTWCDGDNDCEDGSDESPAICGDSHILPFSTPGLEAS 

CE-SNEFQCOSTSRCXPLTWRCDGDNDCEDSSDEE— -SCKPPT 

CE-3lNiEFQCQSTSRCXFX.TWRCDGDNI>CEDS3DEK— SCSAPASEPPGLEAS 

CE-SNSFQCQSTSRCIPLDWVCDGDISIDCEDSSDEA— SCGAPGFT 

CE™SNEFQCQSTSRCIPLTWKCDGDNDCEDSSDEA-~SCGAPGPT 

CG-SDEFQCKSTSRC1PLTWRCDGD3DCEDSSDEA— NCGHPGLEASGGS 

CQ™SNEFTCQSTWRCLPLPWVCDGDNDCEDSSDEA-- -NCGQRT 

CA"ADEFQCNSTGRCIPAAWVCDGDroCEDSSDEA~SCGAPGPT 

CE-ADEFTCRSTNRCIFLQWVCDGDNDCEDSSDEA--SCGAPGFT 

CA-ADEFQCRSTl<iRCIPX.QWVCDGDl^DCEDSSDEA— NC-TP-PT — 



CAPt^EFQCSSTSRCIPQRWLCDGDNDCEDSSDEANCAK—HT 
CAPNEFQCSSTSRCIPQRP5X.CDGDHDCEDSSDEAJ?^CAK--HT 
CVSSBFQCHSTGRCIPRTWLCDGD"l^DCEDSSDEANCAK--HT 
CAP3EFQCQSTKRCIPEGWLCDGVNDCEDSSDEKGCEAPVRT 
CAPSEFQCQSTKRCIPEGV3LCDGWDCEDSSPEKGCEAPVKT 
CVPSEFQCQSTGRCIPRTWLCDGDNDCEDSSDEADCKAPGHT 
CPSDQFQCSSTWRCIPRSmCDGVl^DCEDSSDEADCAA— PT 
CPSDQFQCSSTNKCIPRSWLCDGVNDCEDSSDEADCAA— FT 
CESMQFQCHSTGRCVPQSWRCDGVNDCEDSSDETDCAP--PT 
CLSI^QFQCHSTNRCIPQHWLCDGDISIDCEDSSDEKGCAATGHT 
CGADQFQCQSTKRCVPQRWVCDGDNDCEDSSDEKDCKATART 



CPX-KEFKCG-l<iGRCLPI.RLRCXXENDCGDXSDE-"E Ij^CSAPASEPPG 

CPA-GEFQCK-I^GRCLPPAV?LCDGDl^DCGDNSDE--T GCSAPASEPPG 

CQA~DQFPCS-NGHCVPQTLVXDGVPDCQDDSDE--'r NCSAPASEPPG 

CLA-DEFPCKSTGRCIPAAWLCDGDNDCEDQSDE--T KCSAPASEPPG 

CAA-DEFQCQSTGRCIPVRWLCDGDWDCEDGSDE^-T SCSAPASEPPG 

CIiA-NQFQCRSTGRCXSRDWVCDGVNDCEDGSDE"-T SCSAPASEPPG 

CAA-DQFQCRSTGRCIPRTWLCDGWDCEDGSDEPLA LCSAPASEPPG 

CNT-TQFSCRSraKCXPLDWQCDGVTDCBDWSDE""A SCSAPASEPPG 

CLP-SEFQCKSTl^RCIPQAWLYDGVNDCEDGSDE— T NCSAPASEPPG 

CLF-SEFQCKSTNRCIPQAWLYDGVI^DCEDGSDE— T SCSAPASEPPG 

CLP-SQFQCigSTNRCIPLAWLYDGVNDCEDSSDS—X SCSAPASEPPG 

CXP-SQFTCHSTDKCIPLEWLCDGDNDCEDNSDE— T GCSAPASEPPG 

CEP-t^QFTCHSTSRCIPQPWKCDGVNDCEDGSDEALA TCSAPASEFPG 

CEP-NQFTCHSTSECIPQPWRCDGVI^DCEDGSDEALA TCSAPASEFPG 

CES-NEFQCQSTSRCIPASWLCDGVWDCEDGSDE— T GCSAPASEPPG 

CGS-DEFQCKSTKRCXPI.^^1WX^CDGVNDCEDSSDEPPA TCSAPASEPPG 

CGS-DEFOCKSTRRCXPOTLCDGVl^DCEDSSDEPPA TCSAPASEFPG 

CGS-DSFQCKSTRRCIPLI^WLCDGVPDCEDSSDEPPA TCSAPASEPPG 

CPS-DEFQCNSTGRCXSLTWLCDGVl^lDCEDGSDE— K ^SCSAPASEPPG 

CFS-DEFQCNSTGRClSLTWLCDGVt^DCEDGSDE— K SCSAPASEPPG 

CPS-DEFQCJNISTGRCISLTWLCPGVNDCEDRSDE— K SCSAPASEPPG 

CQSFTEFECHSTGRCXPVDWX.CDGDNDCEDSSDESPA ICSAPASEPPG 

CQSFTEFECHSTGRCIPVDWLCDGDNDCEDSSDESPA ICSAPASEPPG 

CQSFTEFECHSTGRCIPVDWLCDGDWDCEDS3DESSA HCSAPASEPPG 

CBFTAEFECHSTGRClPVAWLCDGDlviDCEDSSDESSA HCSAPASEPPG 

CDPIAEFKCHSTGRCXPLDWLCDGVNDCEDSSDESPA HCSAPASEPPG 

CPS-DEFKCBSTGRCLPVEV?LCDGVNDCEDGSDE-~A SCSAPASEPPG 

CPP-HEFQCHSTGRCISHDTOCDGVWDCEDSSDEAPD LCGAPASEPPG 

CVA-SEFTCRSTGRCXPESWRCDGDt^DCEDSSDESPD LCSAPASEPPG 

CGA»SEFQCRSTGRCLPQHV^RCDGDNDCEDSSDEED CSAPASESPG 

CQA-3EFTCHSTGRCLPRAV3LCDGDNDCEDGSDEED CSAPASEPFG 
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CES-SEFRCR$TGRCIPGGV?LCDGDWDCH;DSSDETD CSAPASEPPG 

CAA"DEFQClSi3TGRCIPVSWVCDGDNDCEDSSDESPD -LCSAPASEPPG 

CAA-DEFQCMSTGRCIPVSWVCDGDNDCEDSSDESSA HCSAPASEPPG 

C AA- DEFQCNS TGRC I PVS WCDGDNDCE DS S DEKD C SAPASEP PG 

5 CSS-DEFQCSSTSRCIPREOTCDGDNDCEDSSDEKD ^ CGAPASEPPG 

CLA"-^EFTCRSTERCIPLGWXXGDT^IDCEDSSDE-"E WCSASASEPPC 

CLX-KEFTCRSTI^RCIPLQWVCXGXNDCEDSSDE— K Ij^CSAFASEPPG 

CLA-NEFTCRSTNRC1PLQWVCDGDHDCED3SDE— A GCSAPASEPPG 

CI^S-MEFTCRSTKRCLPRQWVCDGDNDCEDSSDE— E DCSAPASEPPG 

10 CGS-NQFTCRSTKRCXTATWCDGDNDCEDSSDE— T DCSAPASEPPG 

CQA-l^EFTCRSTSRCIPATWVCDGDNDCEDSSDE— K DCSAPASEPPG 

CES-NEFQCQSTGRCISRDWVCDGD^DCEDSSDE— E DCSAPASEPPG 

CES-I^EFQCQSTSKCIPLTWRCDGDNDCEDSSDEAPE HCSAPASEPPG 

CES-^EFQCQSTSRCIPREWCDGDWDCEDSSDESSA HCSAPASEPPG 

15 CBS-NEFQCHSTGRCIPASWLCDGDMDCEDSSDESSA HCSAPASEPPG 

CRS-NEFTCRSTERCIPLGWVCDGDNDCEDSSDESSA HCSAPASEPPG 

CXSFTEFECRSTGXCIPMWVCDGDMDCEDSSDE— E XCSAPASEPPG 

CEA-SEFTCRSTHRCIPLDWVCDGDNDCEDSSDEKS CSAPASEPPG 

CEA-SEFTCRSTNKCIPVDWVCDGVHDCEDSSDESSD ICSAPASEPPG 

20 CVP-SEFQCRSTNRCIPLDWVCDGD^SIDCEDSSDEKS CSAPASEPPG 

CVS-GEFTCRSTi^RCIPVDWVCDQDNDCEDSSDEKD CGAPASEPPG 

CEP-SQFTCKSTHRCIPQEWVCDGDNDCEDGSDB — K SCSAPAPSPPG 

CEP-SQFPCHSTNRCLPLAWVCDGDNDCEDSSDE--K NCSAPASEPSG 

CES - S QFTCNSTKRCI PLAWVC DGDDDCE DGS DE - SCE AP AHT 

25 CQP™SQFTCHSTDRClPLEWLCDGDNDCEDSSDE-'-K KCKAHT 

CQP-SQFTCHSTDRCXPLEWLCDGDNDCEDSSDE— K NCKAHT 

CLP-SQFTCHSTDRCIPLEWLCDGDKDCEDSSDE--K NCKAHT 

CQP-DQFTCHSTDRCIPLEWLCDGDNDCEDSSDE—K t;!CKAHT 

CPP-HQFTCHSTDRCIPLEWLCDGDNDCEDSSDE— K HCKMIT 

30 CQP-SQFTCXRTDRC1PLEWLCDGDNDCEDSSDE--K IslCKAXT 

Fam6 

CAADEFQClsrSTGRCIPVSOTCDGWiDCEDSSDEAGC-ATSGPT— 

35 CAADEFQC1^!STGRCIPVSWVCDGTODCEDSSDEAGC-ATSGPT— 
CAADEFQCJsrSTGRCIPVSWVCDGVNDCEDSSDEAGC-ATSGP^ — 
CAADQFQCHSTGRCIPVSWVCDGVT^^DCEDSSDEAGC-A'TSGPT-"- 
CAADEFQCNSTGRCIPVSWVCDGVTSiDCEDSSDETDC-APH — T — 
CAADEFQCNSTGRCIPVSWVCDGDKDCEDGSDESPA-LCKAPT — 

40 CAADEFQCNSTGRC I PQEWVCDGVNDCE DSS DES PA'-LCKAPT-- 

CAADEFQCNSTGRCIPVSWVCDGDr^DCEDSSDEES CETPT-™- 

C AASE FQC RS TGRCI PVEWXC DG DNDCE DS S DETG CKXPT — 

CESDEFQCHSTGRCIPLDWVCDGDtSlDCEDSSDE-"K"DCKQHT-- 
CS3DEFQCHSTGRCIPLDWVCDGDNDCEDSSDE— K-DCKQHT^-- 

45 CSSNEFOCHSTGRCIPLQOTCDGDi^DCEDSSDE--K-'DCKXXT-™ 
CVS~'b?EFQCHSTGRCIPKEWRCDGDNDCEDSSDE--K-DCKQHT-- 
CESNEFQCHSTGRCIPASWLCDGDNDCEDSSDE--K-DCKQHT--™ 
CRAKEFQCHSTGRCIPASWLCDGDNDCEDSSDE--^K-DCKQHT™-' 
CLAtJ^EFTCKSTGRCXPASWLCDGDHDCSDSSDE—K-DCKQHT-- 

50 CEASEFQClNlSTGRCIPRQWLCDGDNDCEDSSDE--K-'DCKQHT-™ 
CAASEFQCHSTGRCI PASWLCDGDKDCEDSS DESLA-TCQQHT- - 
CPPDEFRCNSTGRCIPRAWLCDGDNDCEDSSDE— K-DCKQKT— 
CPPDEFRCD^STGRCIPRAWLCDGDNDCEDSSDE— K-DCKQHT--^ 
CPPDEFRCNSTGRCIPRAWLCHGDNDCEDSSDE-- K-DCKQHT-- 

55 CPPDEFRCNSTGRCIPRAWLCDGDISIDCEDSSDE— K-DCKKHT"-^ 

CPPDEFRCMSTGRCIPRAWLCHGDNDCEDSSDE— K-DCKPHT-- 
CLANEFRCHSTGRCIPRAWLCDGDI^DCEDSSDE— K-DCKQHT-- 
CQTGEFRCRSTGRCIPRAWCDGDWDCEDSSDE— K-DCKQHT-- 
CRADEFQCRSfGRCIPGAWRCDGDl^DCEDSSDE— K-DCKQKT-- 

60 CAADEFQCNSTGECIPVSWVCDGDt^DCEDS$DE-"K--DCKQHT— 
CAADEFQCNSTGRCIPLQl^VCDGDt^DCEDSSDE— K-DCKQHT"- 
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CMDEFQCN S TGECIPVS WCDGDl^DCSDS S WCKAHT— 
CAADEFQCNSTGRCXPAEOTCDGDNDCEDSSDE-"-K'-t^CKAHT— 
CAPSEFTCNSTGRCIPQEWVCPGDi>lDCEDSSDE-™K'"DCKQHT— 
CQPNEFQCHSTGRCIPASWLCDGDNDCEDSSDESPA-NCATPTHT 
CVPi^EFQCNSTGRCIPQAWVCDGVNDCEDSSDESSA--LCSEPT-" 

CEPDEFQCRSTGRCI PLEWLCDG DKDCEDS S DET GCAKPT-- 

CPPDEFRCNSTGRCIPLATOCDGDHDCEDSSDET-----NCQPPT'-- 
CAAGBFQOTSTGRClPAAWLCDGD-t^DCEDSSDEEGC-GAAEPT-- 
CQLDQFRCRSTGRCIPQAWLCDGDNDCEDSSDESGC-GAAEPT— 



Fain7 

CP'-ADQFTCRSTDRCIPGDWVCDAVNDCEDGSDEK--NCLER T 

CP-"ADQFTCRSTDRCIPGDWVCDAV^DCEDGSDEK--NCLER T--~ 

15 CG-SDQFQCRSTDKCIPRTWVCDGDNDCEDGSDEK--DCTRS VPT~ 

QQ_ SGQFQCX S TGRCX PRTW VC DGD^^)DCEDS SDEK—NCQPP f 

CQ-SGQFQCXSTGRC1PRTWVCDGDNDCEDSSDEK~NCQPP T- — 

CA"SDQFQCRSTGRCIPQHWLCDGD1SlDCEDGSDEK--l^JCGPPGPSAISTAAG 

. CR- ANEFQCHSTGRCI PASWLCDGDNDCEDG S DE-SQLCTA HT — 

20 CK-AWEFQCHSTGRCIPASWLCDGDNDCEDGSDE'-SQLCTA HT 

CR-ANEFQCHS TGRC 1 P ASWLCDGDNDCEDGS DE-- SQLCTA HT— ~ 

CR™ ANEFQCHS TGRCI PASWLC DGDN DCEDGS DE- S QLCT A HT— 

CR-AHEFQCHSTGRCIPASWLCDGDNDCEDGSDE-SQLCTA HT-~- 

CR-ANEFQCHSTGRCXPASWLCDGD]^DCEDGSDE--SQLCTA HT 

25 CR-At^ E FQCHSTGECI PAS WLCDGDN DCE DG S DE- SQLCTA HT 

CR-AJ^EFQCHSTGRCXPASWLCDGDKIDCEDGSDE-SQLCTA HT— 

CR-ATSlEFQCHSTGECIPASVifLCDGDHDCEDGSDE-SQLCTA HT— 

CR-At^EFQCHSTGKClPASWLCDGDt^DCEDGSDE-SQLCTA HT 

CR-A^EFQCHSTGRCLPASWLCDGDNDCEDGSDE-SQLCTA HT 

30 CL"-Ab:iQFPCHSTGRCIPASWVCDGD!:3DCEDGSDE~SHLCTA HT— 

CR-AWEFPCHSTGRCIPASWLCDGDNDCEDGSDE-SHLCTA HT 

CL-Sl^^EFPCRSTGRCIPASWLCDGDNDCEDGSDE-SQLCTA HT— 

CE-SGEFQCHSTGRCIPASWLCDGDNDCEDGSDE-SQLCTA HT 

CE~SGSFQCHSTGRCIPASWLCDGDt3DCEDGSDE-SQLCTA HT 

35 CE~SG£FQCHSTCRCIPASW1CDGDNDCEDGSDE-SQLCTA HT— 

CEPSGQFECHSTGRCIPASWLCDGDt^DCEDGSDE-SQLCTA HT— 

Cr^-ADEFQCHSTGRCIPQAWRCDGDNDCEDGSDE-SQLCTA HT— 

CE-ASEFTCRSTDRCIPVHWVCDGVNDCEDG3DE-AQVCTE HT 

CL-ADEFRCSSTNRCIFLDWVCDGVNDCEDGSDE-AQVCTE' HT— 

40 cp-agqftcrsti^rcxpx.qwcdg™dcedssde-^sqhcpp HT— 

CP-AGQFTCRSTNRCIPLQWCHGVNDCEDSSDE-SQHCPP HT 

CA-SDEFTCHSTRRCIPQTWVCDGD^iDCEDGSDE-TD-CPP HT 

CP-SSQFTCHSTRRCIPQTWVCDGDNDCEDGSDE-TD-CPP HT— ■ 

CV- SHE FTCRS TKRCVPQEW VC DG Vl^ DC EDGSDE-TG-^CPK HT 

45 CV-SNEFTCRSTKRCVPQEWCDGVNDCEDGSDE-TG-CPK HT 

CV-S^iEFTCRSTKRCVPQEWVCDGVMDCEDGSDE-TG-CPK HT 

CV-SWEFTCRSTKRCVPQEOTCDGVNDCEDGSDE-TG-CPK BT— 

CO-Al^QFKCRSTSRCXPLAWVCDGDl^DCEDGSDE-EG-CKP HT— 

CA-SGQFQCRSTGRCLPLPWVCDGDNDCEDGSDSAPAICEK H— T- 

50 CA-SSEFQCKSTERCLPLEWVCDGVISIDCEDGSDEAPAICTT PGPT- 

CP-PSQFQCRSTGRCIPLHWRCDGTODCEDGSDEPPEPCTA TVPT- 

CQ-Pl>3QFQCKSTGRCLPLDWVCDGVIvlDCEDGSDESSAPCET TGPT- 

CE-SSQFQCRSTGRCLPPDWVCDGVI^DCEDGSDEAG— CQP HR 

CE-ASEFQCRSTKRCLPRHWVCDGDNDCEDGSDEKS—CPA HT 

55 CE-ASEFQCRSTKRCLPRROTCDGDNDCEDGSDEKS--CPA HT— ~ 

CE-ASEFQCRSTKRCLPRHWVCDGDNDCXDGSDEKS™-CPI. H 

CR-- S GQFQCRST WRC I SRT WC DGDN DCE DGS DE A3AI CES SEHT- 

CP-PDEFRCTSTl^RCISRTWCDGDNDCEDGSDEASAICES SEHT- 

CE-SNEFQCQSTSRCIPLTWRCDGVl^DCEDGSDE—ANCTA AVHT- 

60 CE-SHEFQCQSTSRCIPLTWRCDGVNDCEDGSDE— Al^CTA AVHT- 

CE-SNEFQCQSTSRCIPLTWRCDGVMDCEDGSDE—AiNCTA AVHT- 
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CE " SNE FQCQS T S RC 1 PLTWKCDG VX^ DCEDG3DE- -ANCTA AVHT- 

CE-SNEFQCQSTSRClFLTWRCDGV^fDCEDGSDE™-ANCTA AVHT- 

CE--SNEFQCQSTSRCXPL™RCDG\rNDCEDGSDE--ANCTA AVHT~ 

CE-St^EFQCQSTSRCIPLTWRCDGVlQDCBDGSDE^-ANCTA AVHT- 

5 CE-SNBFQCQSTSECIPLTWRCDGVISIDCEDGSDE— ANCTA AVHT- 

CE-Sl^EFQCXSTSRCI PLTWECDG WDGE DGS DE- -ANCTA AVHT - 

CE-SI^1EFQCHSTSRCIPL™RCDRV1J3DCEDGSDE--ANCTA AVHT- 

CE^S13EFQCQSTSRCIPLTWRCDGVXDCEDGSDE"--AXCTA AVHT- 

CE-P SQFTCRSTSRCI PRTWLCDG DNDCEDGS DE- - ANCTA AVHT- 

10 CS-SDEFQCSSTSRCIPREWVCDGVNDCEDGSDEALAPCTS TAHT- 

CS-SXEFQCSSTSRCIPEEOTCDGVNDCEDGSDXALAXC^S TAHT- 

CV-SGEFQCRSTGRCIPRDWLCDGVNDCBDGSDEPSAPCTT AAHT- 



15 FamS 

CPSGEFQCRSraRCXPETWLCDGDNDCEDGSDEE— SCTP-PT— ~ 

CPSGEFQCRSTORC1PETWLCDGDNDCEDGSDEE--SCTP-PT 

CPSGEFQCRST^RCIPEXWLCDGDNDCEDGSDEE--SCTP~PT 

CPSGEF0CRSTNRCIFET1/^LCDGD1^DCEDGSDEE--SCTP"PT 

20 CPSGEFQCRSTNRCIPETWLCDGDWDCEDGSDEE^-SCTP-PT 

CPSGEFQCRST]S}RCIPHl"?WLCDGDIslDCEDGSDEE— SCTP-PT 

CPSGEFQCRSTt^RCIPETWLCDGDNDCEDGSDEE— SCTP-PT 

CPSGEFQCRSTNRCIPETWLCDGDI^DCEDGSDEE— SCTP-PT— • 

CPSGEFQCRSTNRCIPETWLCDGDWDCEDGSDEE—SCTP-PT^ 

25 CPSGEFQCRS™RCXPETWLCDGD]:^DCEDGSDEE— SCTP-PT 

CPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDBE—SCTP-PT- 

CPSGEFQCRSTNRClPETWLCDGDb^DCEDGSDEE— SCTP-FT 

CPSGEFQCRSTNRCIPETWLCDGDWDCEDGSDEE— SCTP-PT 

CPSGEFQCRSraECIPETWLCDGDiqDCEDGSDEE— SCTP-PT 

30 CPSGEFQCRSTNRCXPETWLCDGDNDCEDGSDEE~-SCTP"PT 

CPSGEFQCRSTISIRCIPETOLCDGDHDCEDGSDEE— SCTP-PT 

CPSGEFQCRSraRCXPETWLCDGDNDCEDGSDEE— SCTP-PT 

CPSGEFQCRSTmCIPETWLCDGDlJ^DCEDGSDEE— SCTP~PT 

CPSGEFQCRSTlSfRCIPETWLCDGDNDCEDGSDEE-'-SCTP-PT 

35 CPSGEFQCRSTNRCIPETOLCDGDMDCEDGSDEE— SCTP-PT 

CPSGEFQCRSraRCXPETWLCDGDT^IDCEDGSDEE~SCTP-PT 

CPSGEFQCRSTNKCXPETWLCDGDJJDCEDGSDEE— SCTP-PT 

CPSGEFQCRS™RCXPETP^LCDGDKDCEDGSDEE— SCTP-PT 

CPSGEFQCRSraRCXPSTV^LCDGDNDCEDGSDEE~SCTP-PT 

40 CPSGEFQCRXTNRCIPETWLCDGDNDCEDGSDEE--$CTP"-PT 

CPSGEFQCRSTNRC1PETWLCDGDI^DCEDGSDEX--SCTP-PT 

cppgef6ces™rcipetwlcdgdndcedgse)EE"-sctp-pt 

CPSGEFQCRSraRCIPXTWLCDGDNDCEDGSDEE--3CTP-PT 

CPSGEFQCRST1:^RCIPKTWLCDGD^DCEDGSDEE--SCTP-"PT 

45 CPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDEE--SCTX-XT 

CPSGEFRCRXTXRCXPXTWXjCDGD1^DCEDGSXE£--SCTP"PT 

CPSSQFQCPSMRCXPETWLCDGDNDCEDGSDEK--SCTP-PT 

CQASQFTCGSG1^"CVPPPWGCDGDDDCEDGSDEE--SGTP"-PT 

CPANQFQCRSTNRCVPGMVCDGDNDCEDGSDEE--SCAT-PT 

50 cVAGQFIXiCRSTGRCVSATWVCDGVNDCEDGSDEK--SC»-PT 

CLSDEFRCRSTGRCFPVNWLGDGDNDCSDGSDEE--SCPQ-PT 

CQSDEFTCX-WGQCIPQDWVCDGEDDCGDSSDEAPAHCSQDPEFHKV 
CaPDEFTCNSTGRCIPPDWVCDGVDDCEDGSDET--GCSQDPEFHKy 
CVSSQFa:CRSTGRCIPRAWCDGDDDCEDGSDEK--GCSQDPEFHKV 

55 CQSSEFTCKSTERCIPLTWVCDGDIvIDCEDGSDEE--l^CSQDPEFHKV 
CESDEFTCKSTSRCXPEAV?VCDGDNDCEDGSDET--NCSgDPEFHKV 
CLADEFQCHSTKRCVPKHWLCDGVNDCEDGSDEK— SCSQDPEFHKV 
CPADQFQCRSTGRCIPEHWLCDGVNDCEDSSDEK— GCSQDPEFHKV 

60 

Fam9 
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CPP-DEFTCRSTERCIPLAWVCDGDKDCEDSSDEAG— CT TPEPT- 

CPP"DEFTCESTSECrPLAWVCDGDNDCEDSSDE7\G— CT TPEPT- 

CPP~DEFTCRSTERClPLAV3VCDGDl^DCEDSSDmG--CT TPEPT- 

CPP-DEFTCRSTERCXPLMVCDGONDCEDSSDEAG—CT TPEPT- 

C^P-DEFTCRSTERCIPXAWVCDGDNDCEDSSDEAG—CT TPEPT- 

CPP-DEFTCRSTSRCXPLAWCHGDNDCEDSSDEAG—CT TPEPT- 

CAP"SEFTCRSTGRCIPRTWLCDGDl<iDCEDSSDH;APALCT TPVPA- 

CPA~SEFQCHSTGRCIFASWLCDGONDCEDSSDEAG--CT TPEPT- 

CES-GEFQCHSTGRCIPASWLCDGDWDCEDSSDEAG--CT TPEPT- 

CAS-KEFKC-SSGRCLPPSWLCDGXKDCEDGSDEAN^-CT XPVFT- 

CPS-GEFQCRSTl^^RCIPETWLCDGEDDCGDSSDESLALCGRPG—PGATSAPAA 

CVS-GEFTCRSI^KRCIPVDOTCDGDHDCEDGSDEPPALCGRPG-'-PGATSAPAA 

CES-SEFQCRSTDRCLPVTOVCDGDNDCEDSSDES— 1SICGRPG--PGATSAXAA 

CHAFTQFECRSTGRCIPAAWCDGDNDCEDGSDESPAICGRPG--LGXTSAPAA 

CHAPTQFECRSTGRCIPVSWVCDGWIDCEDGSDESPAICGRPG--LGATSAPAA 

CRA-]:1QFQCHSTGRCIPXSWLCDGVNDCEDGSDESPAICGRPG--LGATSAPAA 

CEA-SEFTCRSTDRCI.PVSWVCDGVNDCEDGSDESPAICGRPG--LGATSAFA- 

CLS-NEFRCSSTGRCLPRPWCDGDNDCEDGSDEAPAICGRPG--PGATSAPAA 

CX.A-SEFTCHSTGRCIPETWVCDGTODCEDSSDE— ADCGRPG—PGATSAPAA 

CLA'-SEFTCHSTGRCIPETWCDGVNDCEDSSDEDWAHCGRPG--PGATSAPAA 

CPP-l^EFTCQSTDRCLPADWVCDGVNDCEDSSDE-ADCGRPG-'-FGATSAPAA 

CDPIAEFKCH5TGRCIPLDWLCDGDNDCEDGSDEAS--CGRPG--PGGTSAPAA 

CQSFTEFECHSTGRCIPVDWLCDGDNDCEDGSDEAS--CGRPG™-PGGTSAPAA 

CR-AHEFQCHSTGKCIPAS1^LCDGDNDCEDGSDEAS--CGRPG-"-PGGTSAPAA 

CA- PDE FPCRSTGRC VPLTV^LCDGDW DCE DGSDE AS ATCGRPG - - FGAT SAPAA 

CRA"t^EFQCHSTGRCIPETWLCDGDKDCBDGSDE— ESC TPPT- 

CRA~1SIEFQCBSTGRCIPA3WLCDGDNDCEDGSDESLEXC PQPT- 

CRA~NEFQCHSTGRCIPASWLCDGD!^DCEDGSDE— KDC KTPGPT- 

CRA-I^EFQCHSTGRCXPASWLCDGDNDCEDSSDE--KGCGDSHILPFXTPGPST 

CRA-NEFQCBSTGRCIPASl^^LCDGDNDCEDSSDE— TGC- AKPT- 

CRA~NEFQCHSTGRCIPASWLCDGDNDCEDSSDE~TGC AKPT-- 

CRA~MEFQCHSTGRCIPASWLCDGDNDCEDSSDE— TGC AKPX- 

CRA--KEFQCHSTGRCIFQTWLCDGDl^DCEDGSDE"-AGC AASGPT- 

CEA-lsIEFQCQSTGRCIPLWLCDGDNDCEDGSDE—TlNlCG TPGPT- 

CEA-SEFTCRSTDRCIPLEWLCDGDNDCEDGSDEAI^J— CG AAART- 

CQS-SSFTCKSTNRCXPJbAWLCDGVNDCEDGSDEAK—CT SPERT- 

CRS-SEFTCRSTSRCIPEtsfWLCDGWfDCEDGSDETG— CG TSAPT- 

CRS-SEFTCRSTSRCIFE1:^WLCDGV|<IDC£DGSDETG--CG TSAPT- 



FamlO 

-CQA-GQFECRSTJ^RCIPQDWVCDGVNDCEDSSDEE SCTSPART 

-CQA-GQFECRSTl^RCIPQDViJVCDGVNDCEDSSDEE SCTSPART 

-CQA-GQFECRST^KCIPQDWVCDG\^DCEDSSDEE SCTSPART 

"CQA-GQFECR3TKRCIPQDWVCDGVNDCEDSSDEE SCTSPART 

-CQA-GQFQCRSTl^RCIPQDWCDGVXDCEDSSDEE RCTSPART 

-CPA-GQFQCRSTt^RCIPQDl^VCDGV^DCEDSSDEE SCTSPART 

-CEA-r^QFRCKSTSRCXPQWWLCDGVI^DCEDSSDEE NCTKTAPT 

-CEA-DEFRCRSTGRCISVDWRGDGVSDCEDSSDEE SCESTAPT 

-CEA-GEFRCKSTDRCXPLAWRCDGVNDCEDSSDEA SCKSSAHT 

~CLA-NEFTCRSTGRCIPRTWRCDGV1NIDCEDGSDEA 

"CLA-i^EFTCRSTGRCXPRTWRCDGVi^DCEDGSDEA NCKKPT— 

-CEA-NEFRCKSTGRCXSQTWRCDGDDDCEDSSDEA l^CKPFT-- 

-CLP-SEFPCS-KGRCVPRPWVCDGDDDCEDl^lSDEA GCPKPT— 

-CEP-GEFPCSSTGRCVPVAWHCDGVl^DCEDGSDET GCQKRT-- 

"CQP-DEFRCRt;^TDXClPQRV?VCDGDNDCEDSSDEA DCQQFT--^ 

^CQA-DEFRCGN-GRCXPQRWVCDGDDDCGDGSDXX DCXTPT-- 

^CLA- DEFRCXSN!>1RCLPLDWVCDGDWDCE DS S DEK DCAXPT-- 

-CPP-DQFFC-DNGDCLPQPWVCDGEXDCPDDSDE ASCTTSVHT 

-CAA-DQFKC-DNGRCVFQNWRCDGEXDCGDigSDE— ENCTTPT-- 
SQFX-GQFKC-GNGi>iCVPRTWRCDGVNDCPD^SDE TDCFTPT-- 
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-CEA-GQFRC"i^iNGNCXPQHWLCXGEXDCED!SiSDE 

-CAP-DXFXC-X!^GKCLPLDWVCDGEDDCGD^3DE TXCQ 

~CAS~1N1QFTC-NNGHCLPQHWRCDGEDDCGDNSDE ASCQFPT-- 

~ CQA- DEFKC'-GNGRCL PE AWC DGED DCGDKSDE ADCQAPT - - 

5 "CQA- DE FRC-GNGRCI S P TWVCDGEXDCGDDS DE ANCATTERT 

™ CQP" GEFRC-RNGKC I PQTl^LXXGX DDCG S DE ADCATTAPT 

" C P P " DEFKC-GISIGHCI S QTWLCDGEXDCGDNS DE— ES CA- -APT 

"CPS-GEFRC-SNGSCXPQEWGCDGXNDCGDDSDE KNCAAAGPT 

-CPS-GEFRCQSSJ^TCIPLlslWLCDGEDDCGDDSDE KKICEASVPT 

10 -CLS-GEFRC"$NGNCLPADWLCDGEDDCGD1SISDE TSCAASEPT 

-CQP-GEFTC"15MGNCI>PLEOTCDGE1^DCGDSSDE EHCGGSEHT 

-CQS-DQFRC-SNGRCIPVEWCDGEDDCX.DGSDEP QVCGTTAPT 

-CPP-DEFEC-SHGRCLPQFWCDGEDDCGDGSDE TSCATTAPT 

-CAS-lSiQFRC-RNGRCIPLPWVCDGEDDCQDNSDE ASCAAPAPT 

15 --CAS-NQFRC-RNGRGIPLPWCDGSDDCQDNSDE ASCAAPAPT 

-CVA-DEFFCGN-GNCIPLPV^RCDGDDDCGDWSDE TDCESSXPT 

-CPP-DEFPCSNSGICIPRSWRCDGEDDCGDNSDE EDCTSAGHT 

-CAP-IsiEFPCGN-GRCIPATWLCDGD]S!DCGDNSDE EGCGGSART 

--CPP-SEFPCGN-GSCVPQAWVCDGDPDCPDNSDE EGCTGTGPT 

20 -CPP-DEFRCMN-GKCI PLSWRCDGDODCQDSSDE AGCT— ER^ 

"^CXP-^GEFQC-NWGECIPATWLCDGDDDCGDNSDE TGCTEHT— 

-CXP-GEFQC"J>iWGKCIPATWLCDGDDDCGDNSDE TGCTEHT— 

""CQS^HEFQC-MNGRCISVTWLCDGDDDCGDSSDE TDCTSAVPT 

-CPS-SEFQCRISINKTCIPRNWLCDGBDDCGDSSDE TDCTTHT— 

25 -'CVP-^GEFRCHDSGTCVPI.AXLCXGDMDCGDWSDE ASCESSEPT 

-CAP-- GQFRCKN- GRC VPLS WVCDG DDDCE DDS DE AN CE S PEPT 

-CAA"DQE'RCSS-GRCVPLTWLCDGDDDCADDSDE KDCESTAHT 

-CAA-DEFQCNSTGRCIPVSWVCDGEDDCRDDSDK ENCRSSEPT 

-CLA~GEFRCNS-GRCIFEHWKCDGEDDCLDSSDE KDCTTSEPT 

30 "CX-AXQFTC-D!>iGQCLFQNWVCDGEt^DCPDXSDS Kt^CAPHT-- 

-CX"SSXFRC"XWGXCLPLXWVCDGE1^DCGDXSDS XXC 

- CV~ ADQFRC- DNGRCL S REWCDG VNDCQDGS DE T^^ICQERT- - 

-CA-AGEFRCRDSGRCLPQHWI;CDGENDCADGSDE THCTQHT— 

~CX-PSEFTC-SSGQCIPE DV3 VCXGXl^ DCGDDS DE TMCETRT- - 

3 5 -CV-ANEFKC-GSGKCI PKTV^ VC DGDM DCGOG S DE ASCAQPT - - 

-CG-ANEFKC-SSGSCIPQEWRCDGEWDCGDNSDES—LAPCKEPT"" 

-CR-ADEFKC-GNGHCIPGQWLCDGENDCQDGSDE KSCEQPT— 

-CL-P15iQFQCQSSGRCIPLIs)WLCDGDDDCGDDSDE TSCKAPT™ 

-CP-ASEFQCGN-^GRCISEHWLCDGDtJDCGDNSDE TSCKAPVPT 

40 " CQ- ADE FQCRl^TEKCL FLMWLC DG DN DCG DDS DE TSC ATPT - - 

- CVA-S E FTCKDT DRC I PLHWVC DG VD DCG DNS DEAD CET3VBT 

"CSA"NEFRCQSTDRCIPASV(fVCDGVDDCEDGSDEKS CTTSGHT 

-CEA-SEFTCNSTGRCLPLTWVCDGVls[DCEDGSDEKS CTTSVRT 

"CAP-l^EFTCSSTGRCLPRAWVCDGVDDCEDGSDETS CGATVHT 

45 -~CGA--WEFTCQSTNRCIPQSWVCDGV1SIDCEDGSDESPV~-LCATTVHT 

-CQP-DEFRCRSTGRCLPQE^9LCDGVMPCEDSSDEAD CGTSAHT 

-CAP-GEFPCRSTGRCIPQTWVCDGVHDCEDSSDEKS CATAEHT 

50 [434] Fam 1 0 monomer domains can be further divided into siibfamilies 

(designated "lOA", lOB''^ etc,)* The following lists the consensus motifs for the various 
siibfamtilies: 

lOA CxxxEFQCNnGRCIPxxWLCDGDdDCGDxSDSTxC 
lOB CPPxEFPCxNGxCIPxxWxCDGDxDCxDNSDEEGCT 
55 IOC CxAgEFrCxxGRCiPLxWxCDGdDDCgDxSDExdCESS 
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lOD CpsGEFRCSNGxCIpqxWlCDGeDDCGDxSDExxCA 
lOE CxADEFKCGNGrCIpxxWvCDGexDCGDdSDExsC 

[435] Several consensas sequences of all Fam 1 0 subfamilies were 

generated: 

5 lOA-E CpaxEFxCxNGrCIPxxWxCDGddDCGDxSDExxC 
lOA-E CxxxEFxCxNGxCIPxxWxCDGxdDCGDxSDExxC 

[436] Thus, c-MET-binding monomer domains having an A domain scaffold 
and comprismg the sequence: 

EFXCXNGXCIPXXWXCDGXDDCGDXSDE 

1 0 axe encompassed by the present invention. 

[437] The following provides c-MET binding diiners, i,e.,polypeptides 
comprising two monomer domains, each of which bind c^MET, Tables following each 
family of dimcrs represent consensus motifs based on alignment of the family members. 
Note that "Fam" designations below refer to dimer families, which are different than the 

1 5 monomer families listed above, 

Pam 1 

CQASDQFECKSTGRCIFLAWECDGDNDCEDGSDESPAICG RPGLKASGGSCRAN- 

20 EFQCHSTGRCIPASWLCDGDI^DCEDGSDE-AS-CGRPGPGGTS— APAA 

ScHSTGRCIPASWLCDGDNDCEDSSDEAPDLCASAAPTSLQASGLEASGGSCHAPTQFECRSTGRClPMWVC 
DG Dl^l DCE DGS DE S P AI CGRPGX.GXT FAA 

25 

CESG-EFQCHSTGRCIPASWLCDGDISIDCEDGSDES-QLCT AHTCAPG- 

h;fqchstgrcipaswlrdgdwdcedgsdes-xi.cta-h>c 

CRSI^-EFTCRSTGRCIPRTWVCDGDE^DCEDGSDESPAICGDSHILPFSTPGLEASGGSCP- 
30 AGQFTCRSTNRCXPLQWCDGDNDCEDSSDEA15— CGDSBILPFSTPGPST 

CLAN-EFTCRSTGRCIPLQOTCDGDNDCEDSSDEK--GCGDSHILP GLEASXGSCX- 

XSQFXCRSTGRCIPAEWCDGDNDCEDSSDEAS— CGDSHILPFSTPGPST 

35 CASS-EFRCRSTGRCIPQRWCDGDNDCEDGSDET--NCGDSHILPFSTPGLEASGGSCQ- 
TGEFRCRSTDRCXPAEWCDGDSDCEDGSDBTN-^CGDSBILPFSTPGPST 

CASS-EFECRSTGRCXPQRWCDGDNDCEDSSDEK—GCGDSHIX.PFSTPGLEASGGSCA" 
ADQFQCRSTGRCIPRTWLCDGVt^DCEDGSDEPLALCSAPASEP PGSL 

40 

CEAS-EFTCESTiqRCXPLQWVCDGDNDCEDSSDEK— GCGDSHILPFSTPGLEASGGSCG- 
S1^IQFTCRSTKRCITATWVCDGD1^^DCEDSSDE-TD-CSAPASEP PGSB 
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15 



CGSD-EFQCKSTSRCIPLTWRCDGDSPCEDSSDEA—lSiCGR PGLEASGGSCQ- 

SGQFQCXSTGRCIPRTWVCDGDlSfDCEDSSDEK-N-CQ-P 

CES^!)-EFQCQSTSRCIPLT^^RCDGDNDCE:DSSDEK""SCSAPASEP"— PGLEMGGSCP^ 
ASEFTCRSTGKCISQGMVCDGD^DCEDSSDESPAICATTG PT 

[438] The consensus sequences below include question marks CT'). These 
indicate positions that can be present or absent 
s 

m ex 

0 1 P! motif 

ch cte 

es d 

e 

1 % 

2 ^ 10" c. [as] [eq] f, c. St. rclp.. w, cdgd. deed. sde.,?.?.c,, 

[st]c..?* .?[eq}f .c.st,rci* . .w[ilv] ,dgdndced. sde. 

6 * • — * 

1 CI, 

1 ^ 10' c. [as] , ,?[eq]f .c.st.rcip. , w. cdgd. deed. sde , .?.?,c..?.?.?.?*?.V*?^?.? 

2 ^ .?.?,?,?.?.?ggsc. .?[eq3f ,c.st .rci. . . w [ilv] cdgdndced. sde. 

I ^ 10' c\ las] ..?[eq]f .c[kr]st.rcip, .w,cdgd.dced,sde, .?.?.cg.?. 

5 4 64 7,7^7,?,7.7^7.7ggsc, . ? [ eq] f . c . st . Tci . . , VT [ llv] cdgdndc6d . sde , .?*?c 

Z ^ 10~ c[ekqr] [as] . . ? [eq] f [eq] c. stgrcip. Ea^]w. cdgdndced.sde[as] , ?. ? [ilv] c, 
73 70 9'?9'7 77 7^7.7.7, 7,7^7. 7,?. . [st]c. .?[eqlf [eqjcstgrcipa [as] w 

1 

0 10*^ c.s[dnl .?ef.c[kr] St, rcip.tw.cdgd. deed, sde [as] .?.?. eg. 

5 ^ 9,7,7,7^7,7.7,7ggsc. *?[as] .?qf .cst.rcip. -wvcdgdndcedssde, ,?»?c. . .? 

00'?7777777 7 7 7 

^ , 10" 5sd , ?ef qckstsrcipltwrcdgdadcedssdea . ? . ?ncg 

' .7, 7, 7ggscq,?s,?qfqc.stgrciprtwvcdgdndoedssdek.?.?cqp. 

6 



Fam 2 

CXAXQFTCD-NGQCLPQNWVCDGENDCPDXSDEKN— C— 
APHTCPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDEESCTPPT 



CQPGEFTCl^-NGNCLPLEWVCDGENDCGDSSDEEN— 
CGGSEHTCPSGEFQCRSTNRCIPETV^LCDGDHOCEDGSDEESCTPPT 

CLAGEFRCN-SGRCIPEBWRCDGEDDCLDSSDEKD— 
20 CTTSEPTCPSGEFQCRSTiNlRCIPETWLCDGDNDCEDGSDEESCTPPT 

CPSGSFRC-S^^GSCIPQEWGCDGXWDCGDDSDEKM"- 
CAAAGPTCPSGEFQCRSTKRCXPKWLCDGDNDCEDGSDEESCTPPT 
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CPSGEFRCQSSt^TCIPLMWLCDGEDDCGDDSDEKN-- 

CEM VPTCPS GEFQCRS TNRC I PET WLCDG DN DCEDGS DSESCT PPT 

C QFRCR-NGRCI PLPWVC DGE DDCQDNS DE AS - - 
5 CAAPAPTCPSGEFQCRSTNRCIPETWLCDGDKDCEDGSDEESCTPPT 

CASlSiQFRCR-NGKCXPLPWCDGEDDCQDl<iSDEAS-- 

caapaptcpsgefqgrst^rcipie;twi,cdgdndcedgsdeesctppt 

10 casmqftcn-nghci>pqhwrcdgeddcgdl^sdeas-~cqp-- 
ptcp s gefqcrstnkc i petwlc dgdisi dcedg s dees ctxxt 

cqade frcg-isigrc i s ptwvc dgex dcg dds dean — 
cattektcpsgefqcrst^rcipefwlcdgdndcedgsdeesctppt 

15 

CPPDEFKCG--NGHCISQTWLCDGEXDCGD1SJSDEBS— GAAP— 
TCPSGEFQCRXTMRCI FETV3LC OQUH DCEDGS DEESCT PPT 

CPPDEFRCS-NGRCLPQPWVCDGEDDCGDGSDE'rs — 
20 CATTAPTCPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDEESCTPPT 

CVANEFKCG-SGKCIPEWVCDGDNDCGDGSDEAS— CAQPT— 
CPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDEESCTPPT 

25 CGAl^EFKCS-SGSCIPQEWRCDGEJ^DCGDl^SDESLAPCKEPT"™ 
CPSGEFQCRSTNRCIPETOLCDGDNDCEDGSDEESCTPPT 

GRADE FKCG -NGHCI PGQ1)9LCDGE1SIDCQDGSDEKS — CEQ PT - - 
CPSGEFQCRSTNRCIPETWLCDGDNDCEDGSDEESCTPPT 

30 

CVPGEFRCHDSGTCVPLAXLCXGDt3DCGDWSDEAS— 
CESSEPTCPSGEFQCRSTNRCXPETWLCDGDNDCEDGSDEESCTPFT 



mat expe 
or , ^ J motif 
ches cted 

e 

Q 

15 10"^^ |eq]f.c.?.?. ..c[ilv] c. -de, d. sde.. ^ 

15 [dn] [dn] . . [de3 . . . [de]e 

14 10'^^ c. . . Eegjf ,c.?,?. . .c[ilv] e.g. .dc.d.sde. ?c. ?cp 

82 sgBfqcr.tnrcip[ekq] twicdgdndcedgsdeesct , 

31 ^ ^ [eq]f ,c.?.?. , .c[ilv] .w,cdg, .dc.d.sd^. . -?.?c, ?,?.?Gp 

85 sgef qcr , tnrcip [ ekq] twlcdgdndcedgsdeesct . , t 

30 j2 10-^9 ^ _ [eq] f ,c.?,?, . ,c[iXvl .w, cdg . ,dc.d. sde. ?c, ?cp 

45 sgefqcr. tnrcip [ekql twlcdgdndcedgsdeesctppt 

2^ 11 10"^^ , . [eq]f ,c.?.?. . .c[ilvl . . . w. edge. dcd.sde. ?c, ?cp 

49 sgef qcr , tnreipetwlcdgdndcedgsdeesctppt 

10 10"^^ c, . . [eq]f .c.?.?, ,,c[ilv]p. .w.cdg, [dn] dc. d.sde . , ,?.?c. 
53 ?cpsgefqcrstnrcip[ekq] twlcdgdndcedgsdeesctppt 

24 p 10"'^'^ c!. . [Bq]f -c,?.?. . .c[ilvlp, ,w.cdge[dn]dc.d.sde, ..?,?c. 
35 ?cpsgefqcrstnrcipetwlcdgdndcedgsdeesctppt 
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95 ?cpsgefqcrstnrcipetwlcdgdndcedgsdeesctppt 

6 10"'^'^ [eq]f.c.?. ?ng.c[ilv]p..w[ilv]cdge[dn]dc,d,sde-..?.?c, 
87 ? , 7 . ?cpsgef qcrstnrcipetwlcdgdndcedgsdeesctppt 

5 10'''* G. , . [eq]f .c.?,?ng.c[ilv]p. .wvcdge [dnjdcd.sde. , .?.?c, 
14 ?cpsgefqcrstnrcipetwlcdgdndcedg5deesctppt 

4 10"®^ c,.[dn3[eq}f[kr]c,?.?ng.c[ilv]p..wCilv]cdge[dn]dc.d.sde.s.?.?c. 
66 ?cpsgef qcratnrcipetwlcdgdndcedgsdeesctppt 



Fam 3 

5 

cpsg-efqcrstnrciph;twlcdge-ddcgdssdeslai.cgrpgpatsapaacp- 

SGKFQCRSTiqRCIPETWX.CDGDt^DCEDGSDH;ESCTPPT 

CPSG-EFQCRSTNRCIPETWLCDGD—NDCEDGSDE ESCTPPTCP- 

10 PGFQCRSTNRCIPBTWLCDGDNDCEDGSDEESCTPPTE 

CQSFTEFECHSTGRCIPASWLCDGD— NDCEDSSDEE-- GCEAAAPTCP- 

SGFRCRXTXRC I PXTV^LCDGDt^l DCEDGSXEESCTPPTE 

15 CEAN-EFQCHSTGRCIPASWLCDGD—NDCBDGSDE SQLCTAHTCP- 

SGFQCRStisiRCIPETWLCDGDiSlDCEDGSDEES-CTPPE 

CXPG-EFQC>11NIGR-C1PATWLCD<3D--DDCGD1^SDET GCT— EHTCP- 

SGFQCRSTNRCIPETWLCDGDl^DCEDGSDEESCTPPTE 

20 

CXPG-EFQCNJqOR-CIPOTWLCDGD— DDCGDl^SDET GCT--EaTCP" 

SGFQCRSTNRCIPETWLCDGDlSiDCBDGSDEESCTPPTE 

CQSN-EFQCNNGR-CISVTWLCDGD—DDCGDSSDET DCTSAVPTCP- 

25 SGFQCRSTNRCIPETWLCDGDNDCEDGSDESSCTPPTE 

CPSS-EFQCRI:^tvfKTCXPRi<lWI.CDGE— DDCGDSSDET DCT--THTCP- 

SGFQCRSTNRCIPEWLCDGDNDCEDGSDEESCTPPTE 

30 CLPS-EFPC-SNGRCVPEPWCDGD--DDCEDNSDEA GCP--KPTCP- 

SGFQCRSTNRCXPXTWLCDGDt^DCEDGSDEESCTPPTE 

Cpps-BFPC-GNGSCVFQMVCDGD--PDCPDI^SDEE GCTGTGPTCP- 

SGFQCRSTWRCIPETWLCDGDMDCEDGSDEESCTPPTE 

35 

CLPW-gFQCQSSGRCIPLMWL.CDGD™"DDCGDDSDET SCK--APTCP- 

SGFQCRSTWRCIPETVJLCDGDNDCEDGSDEESCTPPTE 

CQAD-EFQCRNTEKCLPLNWLCDGD--NDCGDDSDET $CA--TPTCP- 

40 SGFQCRSTlSiRClPETWLCDGDl^DCEDGSDEESCTPPTE 

CQPD-EFRCRNT Dl C IPQRWCDGD" -NDCEDSS DEADCQQPTCR- 
AtilEFQCHSTGRClPETWLCDGDNDCEDGSDEES CTPPT- 
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sc ma exp 

or tch ecte motif 

e es (i 

77 10" c. , . .?[eqlf .c? ?c[ilv] . . . w[ilv] cdg [de] Ide] . 

^13 10 7 , 7 i -7^^ 7 9 [de] ?^?* 

6 ■ ■ 

? 



14 lO'' c. .?[eq]f .c? ?c[ilv] . . .w[ilv]cdg[de] do. d.sde. 

26^2 2& ?.?.?,?..?.?.?...c,.?.?g t..c dg.ldn] tde] ..,?.?. ?,?-?. 

,p. , ? 

CI 

21 10^ cT . . .?[eq]f .c? ?c[iiv] . . . vj [ilv) cdg[dej .?.? .dc ,d. sde, ?,?,?.? . 

^^11 51 7,?,?,?,,?.?.?, .tcp,?.?gf [Kqr]cr,t, rcip.twlcdgdndce. - 

e 

20 10" c' , .,?[eq]f .c? ?c[ilv] . . .w[ilv] cdg [de] .dc.d, sde * 

gglO 55 ?.?,?,?..?.?.?* .tcp.?.?gf[kqrlcr.t.rcip,twlcdgdndce. 
? , ?tppt . ? 

19^ 10" c.,?[eqlf.c.?...,?ctilv]...v^[ilv]cdg[de] .?.?,dc. d.sde-?. ?.?.?. 
9 58 ,tcp.?.?gfqcrstnrcip,twlcdgdndce,?.?.?.?.?.?.?*?t 

ppt.? 

17 10" c . ,,?[eq]f .c? ?ctilv] . . .w[ilv]cdg[de] ,?.?.dc. d,sde. 

60 ?,?,?,?c:*?.?.?*.tcp,?, ?gf qcrstnrcip . twlcdgdndce ,?.?.?.?.?.?.?.?t 

16 10" c.. . .?[eq]fqc.? ?c[ilv] . , .wlcdg[de] .?.?[dn] dc.d.sde, 

62 ,7,?,?.,?.?,?,, tcp . ? . ?gf qcr^tnrcipetwlcdgdndc^ ?tp 
pt,? 

14 10'^ c . .?[eq]fqc.? ?c[ilv] . . . wlcdg Ide] [dn] dcgd.sde, ? 

39^ ^5 .?.?,?c, tcp . ? . ?Qf qcrstnrcipetwlcdgdndce ?tp 

pt.? 

£3L 

12 10' c. , , ,?efqc.?n. [ekqr] .?c[ilv] , , , wXcdg [de] [dn] dcgd, sde ,?.?-?. ? 

,?,?,?.?c. ?,?.?». tcp . ? , ?gf qcrstnrcipetwlcdgdndce .?.?,?.?.?,?-?.? 
tppt,? 

10 10" c, . .?efqc,?n.[ekqr] .?clilvlp, ,wXcdg Jde] .?.? [dn] dcgd.sde ,?.?.?. ? 
05 4 ?c. ?,?.?.. top .? . ?gf qcrstnrcipetwlcdgdndce .?.?.?.?.?.?.?.? 

tppt.? 

'^^3 ^.,,?efqc.?ngr,?ci..twlcdgd.?.?ddcgd.5de.?.?.?.?.?.?.?^?c.?.?.? 
3 . .tcp,?.?gfqcrstnrcipetwlcdgdndce,?.?.?.?.?.?.?.?tppt,? 

Fam 4 

CQPi^EFQCHSTGRClE^ASV^LCDGDNDCEDSSDESPANCATPTHTCPASEFQCHSTGRCIPASWLCDGDMDCEDSS 
5 DEAG"™CTTPEPT 

CAPGQFRCK-MGRCVPLSWCDGDDDCEDDSOE— 

A13CBSPEPTCESGEFQCHSTGRCIFASWLCDGDNDCEDSSDEAG— CTTPEPT 
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CQSDQFRCSN-GRCXPVEWVCDGEDDCLDGSDEP- 

QVCGTTAPTCA^^^DSFQCNSTGRCIPVSWVCDGVNDCEDSSDES.G—CATSGP'r 

CQADEFKCGN-GRCLPEMVCDGEDDCGDl^SDE 

5 ADCQAPTCAADEFQCNSTGRCIPVSWVCDGXKDCEDSSDEAG— CATSGPT 

CPPDEFPCSNSGXCIPRSWRCDGEDDCGDN$DEE"D- 
CTSAGHTCAPSEFTC^STGRCXPQEWVCDGDt^lDCEDSSDEAPDLCASAAPT 

1 0 CQPGEFRCRN -GKC I PQTWLXXGXDOC GDt^ SDE— 

ADCATTAPTCPPDEFTCRSTERCIPLAWVCDGDWDCEDSSDEAG— CTTPEPT 

CLSGEFRCSN-GNCLPADWLCDGEDDCGDNSDE-- 

TSCAASEPTCPPDEFTCKSTE;RCIPLAV?VCDGD^DCEDS$DEAG— CTTPEFT 
CGSSEFQCHSTGRCIPENWCDGDDDCEDSSOE-- 

KSCTSAAPTCPPDEFTCRSTERCIPLAWVCDGDNDCEDSSPEAG—CTTPEPT 

CAADQFKCDI:3"GRCVPQNWRCDGSXDCGD1^SDE— ENCTT— 
20 ptCPPDEFTCRSTEECIPLAWVCDGDNDCEDSSDEAG— CTTPEPT 

sc mat exp 

or che ecte motif 

e s d 

18 g 10'^^ r . leq]f.c..?.?g.c[ilv]p..w, ..g..dc,d. sde. tc. 
79 f.c.st.rcip. ,w[ilv]cdg.ndcedssdea.,?,?c[ast] t^t] . .pt 

1'^ 8 lO^^^c^^ teqlf.c. .?.?g.c[ilvlp,.v^.cdg[de].dc.d.sde.?,?.'>^?..*^?^?.tc. 
76 _ef .c.st.rcip. .w[ilvl cdg.ndcedssdea. ?c:[ast] [st] . .pt 

16 „ |Q^63 ^ [eq]f [kqrjc, , ? , ?g.c [ilv)p. .w.cdg[de3 ,dc .d. sde ^ - ? 
28 /tc. . .ef .c.st.rcip. [as] w[ilvlcdg.ndcedssdeag. ?,?cta5t]t , ,pt 

6 W^^o [eq]f lkqr]c..?.?g. €[ilv]p,.w...g., dcd.sde. ?c. [ast],?. 

51 ? . tc , . , af . c , St . rcip . [as ] w [ilv] cdgdndcedssdeag , ? . ?cttpept 

12 5 lO^^^c ..[eq]f [kqr]c..?,?g.c[llv]p..w.cdg[de] .dc.d.sde.? . ? . ?.?c. [ast] 
76 , ? , ? . tc. . , ef . c . St , reap . tas] w [ilv| cdgdndcedssdeag . ? , ?cttp8pt 

a 

1^ 4 10-^^r!..[eq]f [kqr]c..?,?g. c[xXv]p..w...9.. dcd.sde. ?c[astltasfc 
20 ] ^ ■? , ?ptcppdef tcrsterciplawvcdgdndcedssdeag . ? . ?cttpept 

3 10"^^ c. las] . [eq]f[kqr]c, .?.?g.c[ilv]p. [dn] cdg [de] .dc, d, sde ?c 
8 [asti [a^t] ,?.?ptcppdeltcrsterciplawvcdgdndcedssdeag,?,?cttpept 

^2 2 lO^^^c.Us] . [eqlf[kqr]c. ,?.?grc[iv]p[eq]nw.cdg[de] .dcd, 3de ?ct 
2 [St] . ? . ?ptGppdef tcrsterciplawvcdgdBdcedssdeag . ? . ?cttpept 



25 T? tX£tixi i3l& XI 

[439] Monomers or multimers that bind human IgG and/or IgGs from other 
species, incliading cynomolgus monkey IgG (referred to collectively as IgG) were identified 
essentially by the methods described in Example 7. 
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[440] The foUowmg IgG-binding monomers were identified. Tables 
following each family of dimers represent consensiis motifs based on alignment of the family 
members. 

CASGQFQCRSTSICVPMV^RCDGVPDCPD^SDEK—SCEPP T 

CASGQFOCRSTSlCVPl^WWRCDGVPDCVDi;!SDET~SCTST VET - 

CASGQFQCRSTSXCVPMl^RCDGVPDCi^DGSDKK— DCQQH T — - 

CASGQFQCRSTSICVPHOTRCDGV1^DCGDGSDEA--DCGRPGPGMSAFAA-- 

CASGQFQCRSTSlCVPHWWRCDGVE^DCLDSSDEK—SCmP ASEPPGSL 

CASGQFQCRSTSlCVPMWRCDGVPDCRDGSDEAPAHCSiVP MEPPGSX. 

CASGQFQCRSTSICVPQ^WVGDGVPDCRDGSDEP-EQCTPP T 

CLSSQFRCRDTGICVPQ^?WCDGVPDCGDGSDEKG--CGRT GHT 

CLSSQFRCRDTGXCVPQWWVCDGVPDCRDGSDEAAV-CGRP GHT - 

CLSSQFRCRDTGICVPQWWVCDGVFDCRDGSDEAPAHCSAP ASEPPGSL 

scor match expeete ^^^^ 
e es d 

10 10^^'^ c.s.qf [kqrlcr.t. icvp.ww.cdgv.de. d,sde,.?.?.?c 

5 9 7 *? , ? , ? 



9 7 



9 10'^^ c.s.qf [kqr]cr.t.icvp.ww.cdgvpdc.d.sde. 
6 ^ '3 ^ ? , ? , ? 



942 7 10"^^ casgqfqcrstsicvp.ww,cdgv.dc,d.sde 

^ 7 '? , ? 



866 6 10"^*^ casgqfqcrstsicvpmwwrcdgv 

9 9.99 



.dc.d^sde. ? .?c 



Fam2 

CGAS-EFTCRSSSRCIPQAWVCDGEI3DCRD^^1SDE--ADCSAPASEPPGSL 
CRSN-EFTCRSSERCIPLAWCDGDNDCRPDSDE--ANCSAPASEPPGSL 
CVSb3-EFQCRGTRRCIPETWLCDGLPDCGDNSDEAPAilCSAPASEPPGSL 
CHPTGQFRCRSSGRCVSPTWVCDGDWDCGD^^SDE— Et^CSAPASEPPGSL 
CQAG-EF0C-GNGRCISPAOTCDGE>3DCRDGSDE™-A^3CSAPASEPPGSL 

scor match expeete ^^^j^ 
e es d 

J ^0-3. ^...,jeq]f.c.?...rc[iv]..tast]w[nv]cdg..dc.d.sde.?.?.[dn]c 

sapaseppgsl 

584 4 10"'' cl^,..?[eq3f.c.?.. .rc[iv] . . [ast] wvcdg [de] ndc.d. sde . ? , ? . [da]cs 
apaseppqsl 

480 3 10-^' ^[as].,?sf.c.?...rci..awvcdgtde]ndcrd.sde,?.?a[dn]csapasepp 

gsl 

14411 A motif summarizing the Family 2 IgG binding A domain monomers 

follows: 
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teql f xcrx [st] xrcliv] jcxxwCilv] cdgxxdcxdtdn] sde 

CPPSQFTCKSNDKCIPVHWLCDGDNDCGDSSDE— ANCGRPGPGMSAPAA 
5 CPSGEFPCRSSGRCI PLAWLCDGDD^DCRDNSDEPPALCGRPGPGATSAPAA 

CAPSEFQCRSSGRCIPLPWVCDGEDDCRDGSDES-AVCGAPAP~T 

CQASBFTCKSSGRCIPQEWX>CDGEDDCRDSSDE— lOilCQQPT 

CLSSEFQCQSSGRCIPLAWVCDGD]>iDCRDDSDE— KSCKPRT 

score mJttches expected motif 



■J 9 



10 

(442] Two motifs summarizing tihe Family 3 IgG binding A domam 
monomers follow; 

CXS SGRC X PXX WCDGXXDCRDXS DE 
CXSSGRCIPXXWLCDGXXDCRDXSDE 

15 

{443] Based on family 3 aligmnents, the invention provides polypeptides 
comprising non^naturally occnmng monomer domains that bind IgG and lhat has the 
sequence SSGR immediately preceding the third cysteine In an A domain scaffold. 

20 

Fam4 

CPA^EFQCSNGKCISPAWLCDGENDCVDGSDE^-KGCTPRT 
CPFSEFQCGNGRCISPAWLCDGDt^DCVDGSDE-~TlslCTTSGPT 

CP PGEFQCGNGHC I SAGWVC DGEN DCVDDS DE- - KDCP ART 
25 CGSGEFQCSNGKCISLGWVCDGEDDCPDGSDE—TWCGDSHXLPFSTPGPST 
CPADEFTCGNGRCISPAWCDGEPDCRDGSDE-AAVCETHT 
CPSi<iEFTCGKiGRCISLAWLCDGSPDCRDSSDKSLAICSQDPEFHKV 

score matches expected motif 

5 iQ^"^^ cp. .ef .c.ngrcis. *w[ilvlcdgtde] .dc, d. sde 

452 4 10'^'^ cp. .ef .cgngrcis..w[llv]cdglde] . dc.d. sde . 

3gy 3 10^^^ cp. .efqc.ngrcls. .w[ilvlcdgldelndcvd.sdev?.?. -c. 



30 
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Example 10 

(444] This example illystrates binding affinities of IgG-bmdmg monomers 
for IgG from vario-us animat species. 





IgG AlBnity (i)M) by Species 


Domain 


Human 


CynoHiolgus 
macaque 


Mouse 


Rat 


IgM02 


10.7 


8.7 


23.8 


47.7 


M56 


95.5 


585 


1326 


1513 


RM09 


3900 


8900 


5300 


5750 


RM15 


812 


5630 


6300 


>10000 


502 


336 


1850 


>10000 


>10000 



Table: Affinity of IgG-binding domains by species 

[445] 0.2 ug of whole IgG fraction from the indicated species was 

immobilized in duplicate wells of a 96-well Maxisoip plate (Nunc) and blocked with 1% 
BSA. Serial dilutions of purified damsons were then added, and the araomt of bound protein 
10 was quantitated via an HRP-conjugated, hi^-affinity anti-HA secondary antibody using 

standard ELISA methods. The data was fit to a 1 :1 binding model using a non-linear best fit 
algorithm to determine the Kd (affinity). 

Example 11 

2 5 [446] This example describes an experiment designed to illustrate 

pharmacokinetic half-life conferred on a multimer by the presence of an IgG-binding Avimer 
domain, Ig-M02. 

[447] Avimer construct C242 is a trimer of Avimer domains (-15 kDa), The 
N-temsinal Avimer domain is Ig-M02. Three Cynoraolgus macaques were injected with a 
20 single 1 mg/lcg dose of Avimer C242 which had been trace-labeled with '^^I. Monkeys 1 and 
2 received intravenous doses; monkey 3 received an intramuscular dose. Senrai samples 
were obtained and assessed for '^^I cpm at the times indicated in Figure 10, out to 288 hr. 

[448] The observed terminal serum half-life in this experiment is -53 hr, 
which allometrically scales to a predicted -106 hr half-life in humans. In a similar 
25 experiment in mice, a 7-9 hr terminal half-life was observed, consistent with the rodent's 
smaller size. Furthermore, as the plots for the i.m.-injected and i.v.-injected animals 
converge and become nearly identical from about 12 hr post-injecHon onward, we infer that 
the monomer exhibits high bioavailability in vivo. 
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[449] 53 hr is significantly longer than expected for a protein of this size. 
For example, the semm half-life of the 20 WDa cytokine interleukin-S in marmosets is 4-6 hr 
(Ryffel, B. et al. Blood 83, 2093-102 (1994)), Thiis the IgG-^binding Ig"M02 domain confers 
a half-life sufficiently long to allow at least once weekly dosing in hitman subjects. 

[450] While the foregoing invention has been described in some detail for 
pxirposes of clarity and understanding, it will be clear to one skilled in the art from a reading 
of this disclosme that various changes in foim and detail can be made without departing from 
the true scope of the invention. For example, all the techniques, methods, compositions, 
apparatus and systems described above can be used in various combinations. All 
publications^ patents, patent applications, or other documents cited in this application are 
incorporated by reference in their entirety for all purposes to the same extent as if each 
individual publication, patent, patent application, or other document were individually 
indicated to be incorporated by reference for all purposes. 



120 



wo immmnss 



SEQUENCE LISTING 



PCT/US2005/021558 



SEQ ID NO:l - c-MET Amino add sequence 

1 mkapaviapg ilvilftlvq rsngeckeal aksemnvnmk yqlpnftaet piqnvilheh 

61 hiflgatnyi yvlneedlqk vaeyktgpvl ehpdcfpcqd csskanlsgg vwkdnimnal 

121 vvdtyyddql iscgsvnrgt cqrhvfphnh tadiqsevhc ifspqieeps qcpdcvvsal 

181 gakvlssvkd rfinffvgnt inssyfpdhp Ihsi^vrrlk etkdgfmflt dqsyidvlpe 

241 frdayplkyv hafesnnfiy fltvqretld aqtfhtriir fcsinsglhs ymemplecil 

301 tekrkkrstk kevfnilqaa yvskpgaqXa rqigaslndd ilfgvfaqsk pdsaepmdrs 

361 amcafpikyv ndffnkivnk nnvrclqhfy gpnhehcfnr tllrnasgce arrdeyrtef 

421 ttalqrvdlf ragqfsevllt sistfikgdl tianlgtseg rfmqvvvsrs gpstphvnfl 

481 Idshpvspev ivehtlnqng ytlvitgkki tkiplnglgc rhfqacsqcl sappfvqcgw 

541 chdkcyrsee clsgtwtqqi cXpaiykvfp naapleggtr Iticgwdfgf rrnnkfdlkk 

601 trvlXgne^c tltlsestmn tlkctvgpam nkhfnmsiii. snghgttqys tfsyvdpvit 

661 slspkygpma ggtlltltgn ylnsgnsrhi siggktctlk svsnsilecy tpaqtistef 

721 avklkidlan retsifsyre dpivyeihpt ksfiaggsti tgvgknXnav svprmvinvh 

781 eagrnftvac qhrsnseiic cttpslqqln IqXpXktkaf fmXdgilsky fdXiyvhnpv 

841 fkpfekpvmi smgnenvlei kgndidpeav kgevXkvgnk scenihXhse avXctvpndl 

901 IklnseXnie wkqaisstvl gkvivqpdqn ftgXiagvvs istallXXXg ffXwXkkrkq 

961 ikdXgseivr ydarvhtphl drXvsarsvs pttemvsnes vdyratfped qfpnssqngs 

1021 crqvqypXtd mspiXtsgds disspXXqnt vhidXsaXnp elvqavqhvv igpsslivhf 

1081 nevigrghfg cvyhgtlldn dgkkihcavk sXnritdige vsqfXtegii mkdfshpnvX 

1141 sXXgicXrse gspXvvlpym khgdlrnfir nethnptvkd ligfgXqvak gmkylaskkf 

1201 vhrdXaarnc mldekftvkv adfglardmy dkeyysvhnk tgaklpvkwm aXeslqtqkf 

1261 ttksdvwsfg- vlXweXmtrg appypdvntf ditvyllqgr rlXqpeycpd plyevmXkcw 

1321 hpkaemrpsf salvsrisai fstfigehyv hvnatyvnvk cvapypsXls sednaddevd 

1331 trpasfwets 
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WHAT IS CLAIMED IS: 

1 1 . A polypeptide comprismg a monomer domam tiiat binds to c-MET, 

2 wherein the monomer dotnain: 

3 is a lion-naturally-occuning monomer domain comprising 30 to 50 amino 

4 acids; and 

5 comprises at least one disulfide bond. 

1 2. The polypeptide of claim 1 , wherein the monomer domain is an LDL 

2 receptor class A monomer domaia 

1 3 , The polypeptide of claim 1 , wherein the monomer domain is an LDL 

2 receptor class A monomer domain comprising the following sequence: 

3 EFXCXNGXCIPXXWXCDGXDDCGDXSDE, 

4 wherein X is any amino acid. 

1 4. The polypeptide of claim 1 , wherein the polypeptide comprises at least 

2 one and no more than six monomer domains that bind c-MBT. 

1 5, The polypeptide of claim 1 , wherein the polypeptide comprises at least 

2 two monomer domains that bind c~MET, 

1 6. The polypeptide of claim 1, wherein the polypeptide further comprises 

2 a second monomer domain, wherein the second monomer domain has a binding specificity 

3 for a blood factor, thereby inaeasing the serum half-life of the polypeptide when the 

4 polypeptide is injected into an animal compared to the serum half-Hfe of a polypeptide 

5 lacking the blood factor-binding monomer domain. 

1 7, The polypeptide of claim 6, wherein the blood factor is serum albumin, 

2 an immunoglobulin or an erythrocyte, 

1 8. The polypeptide of claim 6, wherein the second monomer domain 

2 binds to immunoglobulin (IgG) and the second monomer domain is an LDL receptor class A 

3 monomer domain comprising a sequence selected from the following: 

4 CXSSGRCIPXXWVCDGXXDCRDXSDE, and 

5 CXSSGRCrPXXWLCDGXXDCRDXSDE, 

6 wherein X is any amino acid. 
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1 9. The polypeptide of claim 6, wherein (he second monomer domain 

2 binds to immitnoglobulin (IgG) and the second monomer domain is an LDL receptor class A 

3 monomer domain comprising the following sequence: 

4 [EQ3FXCRX[ST]XRC[IV]XXXW[ILV]CDGXXDCXD[DN]SDE, 

5 wherein X is any amino acid and amino acids in brackets are alternative amino 

6 acids at a single position. 

1 1 0. The polypeptide of claim 6, wherein the second monomer domain 

2 comprises CHPTGQFRCRSSGRCVSPTWVCDGDNDCGDNSDEENC, 

1 1 L The polypeptide of claim 1 , wherein the monomer domains are each 

2 between 35 to 45 amino acids. 

1 1 2. The polypeptide of claim 1 , wherein each monomer domain comprises 

2 three disulfide bonds. 

1 1 3 . The polypeptide of claim 1 , wherein the monomer domain is a non- 

2 naturally occurring LDL-receptor class A domain. 

1 14. The polypeptide of claim 1 , wherein the monomer comprises an amino 

2 acid sequence in which: 



3 at least 1 0% of the amino acids in the sequence are cysteine; and/ or 

4 at least 25% of the amino acids are non^naturally-occurring amino acids. 

1 1 5, A method for identifying a polypeptide that binds to c-MET, the 

2 method comprising, 

3 screening a library of polypeptides for aflSnity to c-MET; and 

4 selecting a polypeptide comprising at least one monomer domain that binds to 

5 c-MET, wherein the monomer domain: 

6 has between 30-100 amino acids; 

7 is a non-naturally-occurring monomer domain; 

8 comprises at least one disulfide bond, 

1 16. The method of claim 1 5, wherein the selecting step comprises selecting 

2 a polypeptide that reduces HGF-mediated cell proliferation and/or migration. 
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1 17. The method of claim 15, ferther comprising selecting a polypeptide 

2 that inhibits tumor growth in an animal. 

1 18. The method of claim 1 5, wherein ttie monomer comprises aa amino 

2 acid sequence in which: 

3 at least 1 0% of the amino acids in the sequence are cysteine; and/or 

4 at least 25% of the amino acids are non-natorally-occnrring amino acids, 

1 1 9. The method of claim 1 5, further comprising linking the monomer 

2 domain in the selected polypeptide to a second monomer domain to form a library of 

3 mummers, each multimer comprising at least two monomer domains; 

4 screening the library of midtimers for the ability to bind to c-MET; and 

5 selecting a multimer that binds c-MET. 

1 20. The method of claim 1 5, further comprising liddng the monomer 

2 domain in the selected polypeptide to a second monomer domain to form a library of 

3 mnltimers, each mnltimer comprising at least two monomer domains; 

4 screening the library of mnltimers for the ability to bind to a target molecule 

5 other than the c-MET; and 

6 selecting a multimer that binds to the target molecule. 

1 21. The method of claim 1 5, further comprising a step of mutating at least 

2 one monomer domain, thereby providing a library comprising mutated monomer domains, 

1 22, The method of claim 15, wherein Hie library of monomer domains is 

2 expressed as a phage display, ribosome display or cell surface display, 

1 23. The method of claim 15, wherein the polypeptide comprises at least 

2 two monomer domains and the monomer domains are linked by a linker. 

1 24, The method of claian 1 5, wherein the monomer domains are each 

2 between 35 to 45 amino acids. 

1 25 . The method of claim 1 5, wherein each monomer domain comprises 

2 three disulfide bonds. 
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1 26. The method of claim 15, wherein at least one of the monomer domains 

2 is a LDL-receptor class A domain. 

1 27 , The method of claim 1 5 , wherein the monomer domain comprises an 

2 amino acid sequence in which: 

3 at least 1 0% of the amino acids in the sequence are cysteine; and/ or 

4 at least 25% of the amino acids are non-naturally-occnmng amino acids. 

1 28. A polynucleotide encoding the polypeptide of claim 1 , 

1 29. A polypeptide comprising a monomer domain that binds to 

2 immunoglobnlin-G (IgG), wherein the monomer domain is an LDL receptor class A 

3 monomer domain comprising sequence selected from the fcllowmg: 

4 CXSSGRCDPXXWVCDGXXDCRDXSDE, 

5 CXSSGRCIPXXWLCDGXXDCRDXSDE, and 

6 [EQ]FXCRX[ST]XRC[IV]XXXW[ILV]CDGXXDCXD[DN]SDE 

7 wherein X is any amino acid and amino acids in brackets are alternative amino 

8 acids at a single position; and 

9 wherein the polypeptide has an increased serum half-life when the polypeptide 

1 0 is injected into an animal compared to tbe serum half-life of a polypeptide lacking the 

1 1 monomer domain that binds to IgG. 

1 30. The polypeptide of claim 29, wherein the monomer domain comprises 

2 CHPTGQFRCRSSGRCVSPTWVCDGDNDCGDNSDEENCSAPASEPPGSL 

1 31. The polypeptide of claim 29^ further comprising a second monomer 

2 domain with binding specificity for a molecule other than IgG, wherem the second monomer 

3 domain: 

4 has between 30-100 amino acids; 

5 is a non-naturally-occurring monomer domain; 

6 comprises at least one disulfide bond, 

1 32. The polypeptide of claim 31, wherein the second monomer domain is a 

2 non-naturally-occuring LDL-receptor class A domain. 

1 33. A polynucleotide that encodes the polypeptide of claim 29. 
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